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(57) Abstract 

The present invention includes methods for rational drug design. 
One exemplary method disclosed herein teaches the preparation of a 
highly specific affector of a first enzyme when the first enzyme is a mem- 
ber of a class of enzymes having a conserved catalytic core with a plurali- 
ty of invariant residues. This method comprises identifying a second en- 
zyme that is a member of that class of enzymes and has a known affector. 
The affector can be an inhibitor or activator of the second enzyme. In the 
practise of the method, a first complex is formed between the second en- 
zyme and the known affector and data is obtained regarding the three-di- 
mensional coordinates of the invariant residues in the complex. These 
coordinates are used to form a template. A model is then generated in 
which the first enzyme is in a conformation with the invariant residues in 
substantially the same conformation as in the template. Changes in the 
variable residues of the catalytic core of the first enzyme are compared to 
the variable residues in the catalytic core of the second enzyme. The sec- 
ond enzyme is modified to include these non-conserved changes and an 
affector is designed using computer modelling that will induce a confor- 
mation on the catalytic core of the first enzyme in which the invariant res- 
idues are in substantially the same coordinates as the coordinates of the 
template when the first enzyme is formed as a second complex with the 
newly designed affector. The designed affector can be further refined to 
provide improved affector activity. 




JSDOCID: <WO 3302209A1 > 



4 



1 



FOR THE PURPOSES OF INFORMATION ONLY 

Codes used to identify State* party to the PCI' on the front pages of pamphlets publishing international 
applications under the PCT. 



AT 


Auatrij 


Ft 


Hnlaml 


Mt 


Mali 


AU 


A uMr.il u 


FK 


France 


MM 


Mongolia 


BB 


BarhaJtn 


GA 


tJaboii 


MR 


Mauritania 


BC 


Belgium 


GB 


Unite*! Kingdom 


MW 


Malawi 


BF 


Burkina ha*o 


CN 


Guinea 


fit 


Netherlands 


BC 


Bulgaria 


GR 


Greece 


NO 


Norway 


BJ 


Bcttnt 


HU 


Hungary 


PL 


Poland 


BR 


Bray (I 


IE 


Ireland 


AO 


Human iu 


CA 


Canada 


IT 


It illy 


RU 


Russian Federation 


CF 


( ctitr^l Airmail Republic 


J? 


Japan 


50 


Sudan 


CC 


Congo 


KH 


Ocmocrulk Peopled Republic 


SE 


Sweden 


CH 


bwtvvtUttiti 




or Korea 


SN 


Senegal 




C*6tC U'lvotfe 


Kk 


Republic of Korea 


SU 


Soviet. Union 


CM 


Cameroon 


Lt 


liucltlcrfetuin 


TO 


Chad 


cs 


Chechoslovakia 


LK 


in luidLa 


TC 


Togo 


oe 


Germany 


LU 


l.ufcembourg 


US 


United Stale* of America 


UK 


Denmark 


MC 


Monaco 






£& 


Spain 


MC 


Madagascar 







BNSDOC1D- <VVO 9302?09A1> 



WO 93/02209 



O PCT/US92/06I37 



METHODS OF DESIGNING SPECIFIC AFFECTORS USING THREE DIMENSIONAL 
CONFORMATION OF ENZYME/AFFECTOR COMPLEX 
Field of the Invention 
The present invention relates to rational design of specific affectors for a given 
5 enzyme using data obtained regarding the three dimensional conformation of an 

enzyme/affector complex. More particularly, it relates to such methods wherein the 
conformation of the conserved catalytic core of a given enzyme class is elucidated and highly 
specific affector molecules for a particular member of that class are designed. 

Background of the Invention 
10 Drug design based on an analysis of the structural features of a molecule is still in 

its infancy. At present, an analysis of X-ray crystallographic data at best permits the design 
of broadly acting affector molecules. While these affector molecules can be further refined 
to impart some selectivity, affector design does not produce molecules having the fine tuned 
specificity of, for example, an antibody for its antigen. This level of selectivity control is not 
15 always necessary; however, therapeutic regimes directed to the control of enzymes involved 

in certain cancers, genetic disorders, and infectious agents will require this type of 
selectivity. 

Enzymes can be classified into broad families or classes having similar activities, with 
each enzyme having a specific function. For example, many proteins phosphorylate their 
2 0 substrate. These enzymes are broadly labelled as kinases. A myriad of kinases exist for a 

myriad of functions. Within this broad group, kinases can be subgrouped based on 
similarities in substrate, requirements for additional cofactors or similar amino acid residues 
that are targets for phosphorylation. 

Within any given cell, there may be many active members of a given enzyme family. 

2 5 If one member of the family shows aberrant activity, then it may be therapeutically 

advantageous to alter the activity of this single enzyme to the exclusion of other similar or 
related enzymes. Such is the case for the protein kinase family where aberrant 
phosphorylation events can be associated with abnormal cell growth and regulation. This 
is observed in proto-oncogene related cancers. For example, the ppeo 0 '** 0 protein, needs 

3 0 to be controlled to the exclusion of other protein kinases in order to maintain normal cell 

metabolism. 

Protein phosphorylation as a mechanism for regulating protein activity was first 
recognized in 1955 with glycogen phosphorylase. Protein phosphorylation and 
dephosphorylation is widespread and impacts nearly all aspects of growth and homeostasis 



BMSDOC»D<VVO 93022O9A1> 



WO 93/02209 



-2- 

in the eukaryotic cell. Protein kinases catalyze the transfer of the Y-phosphate of MgATP 
to a protein substrate. The protein kinases, constitute a large and veiy diverse family of 
enzymes. Although these enzymes differ in size, substrate specificity, mechanism of 
activation, subunit composition, and subcellular localization, all, nevertheless, share a 
5 homologous catalytic core that has been conserved throughout evolution. 

It is not yet possible to regulate a given enzyme at will. While there are hundreds 
of different protein kinases, only a few of these can be readily purified. Moreover, even 
among those enzymes that can be purified, many cannot be used for X-ray crystallographic 
studies. The sequences of many enzymes have been cloned and expressed; however, not all 
10 of these are chemically active. Therefore, even if a molecule that cannot be readily purified 

is cloned and expressed, it may not be functional and thus, would not provide an adequate 
model for structural studies. Further, even if a recombinant protein is functional, it may not 
be readily crystallizable. These and other roadblocks have heretofore prevented the design 
or identification of affector molecules directed to a particular enzyme. Thus, heretofore it 
15 has not been possible to provide a method for the design of affector molecules for a given 

member of an enzyme family. 

Brief Description of the Figures 
Figure 1 illustrates the general topological arrangement of the catalytic subunits from 
a number of different protein kinases. 
!0 Figure 2 diagrams the placement of the catalytic region within various members of 

the protein kinase family. 

Figure 3 is a stereo view of the electron density for the structure determination. 
Figure 3A provides the density calculated to 2.7 A. Figure 3B provides the density calculated 
with 10.0 to 2.7 A refined model phases. 

5 Figure 4 is a stereo view of the C-a backbone and includes twenty residues of 

PKI(5-24). 

Figure 5 provides data on the location and orientation of MgATP. Figure 5A 
illustrates the general localization of MgATP. Figure SB is a close-up of difference density 
enclosing an oriented model of AMP. 
0 Figure 6 is an overall two dimensional topology diagram for the C-subunit. of cAPK. 

Figure 7 provides stereo views of selected conserved areas. 

Figure 8 illustrates the conserved catalytic core of c-AMP dependent protein kinase. 
Figure 8A is a space-fining model of the catalytic core. Figure 8B is a diagram of the 
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conserved catalytic core using the RIBBON program of the PAP package. Figure 8C is a 
space-filling model identical to A, but includes PKI(5-24). 

Figure 9 diagrams the conformation of bound PKI (5-24). 

Figure 10 illustrates the high affinity binding site interactions between the catalytic 
5 subunit and the inhibitor peptide. In Figure 10A the P-ll Phe ring (numbered 356) is shown 

in relation to three nearby hydrophobic residues of the C-subunit. Figure 10B illustrates the 
interactions of the P-ll Phe and P-6 Arg sidechains with the C-subunit. 

Figure 11 summarizes the interactions that contribute to recognition of the common 
consensus region of the inhibitor peptide. 
10 Figure 12 illustrates the consensus recognition site binding interactions. Figure 12A 

is an illustration of the electron density corresponding to the anionic P-3 site. Figure 12B 
illustrates the electron density of the P-2 Arg side chain. Figure 12C illustrates the electron 
density of the P+ 1 lie sidechain. 

Figure 13 illustrates the catalytic site area. Figure 13A provides the site of catalysis 
15 together with the probable catalytic base sidechain of Asp 166 near the p-C of the P Ala. 

Figure 13B diagrams the consensus recognition site residues Arg-Arg-Asn-Aia-Ile. Figure 
13C shows the phosphate of Thr 197 buried among sidechains of cationic and H-bonding 
residues. 

Figure 14 is a schematic illustrating the relationship of invariant amino acids at the 

2 0 active site. 

Figure 15 is a schematic of the conserved and variable regions of the catalytic 

subunit. 

Figure 16 illustrates the amino acids present in PKI(5-24) that provide important 
interactions with cAPK. 

25 Figure 17 provides a list of the coordinates that define the three-dimensional 

template. 

Figure 18 provides photographs of the crystal forms. 

Summary of the Invention 
In accordance with one aspect of the present invention, there is provided a method 

3 0 of designing a highly specific affector which exerts an effect on the activity of a first enzyme. 

In this method, the first enzyme is a member of a class of enzymes having a conserved 
catalytic core. The method comprises the following steps: identifying a second enzyme that 
is a member of the class in which a first affector can affect the activity of the second 
enzyme, forming a first complex of the first affector and the second enzyme, obtaining data 



<WO 9302209At> 



WO 93/02209 ^ ^ PCT/US92/06137 

regarding the conformation of the second enzyme at sites greater than 5 A from the site of 
catalysis of the second enzyme in the first complex, and designing an affector which induces 
a conformation on the first en2yme at sites greater than 5 A from the site of catalysis 
thereof which is homologous to the conformation of the second enzyme at homologous sites 
5 in the first complex, when the affector is formed as a second complex with the first enzyme. 

Preferably, this method additionally comprises crystallizing the first complex and obtaining 
X-ray crystallography data therefrom. In a preferred form of this method, all of the 
members of the class have related functions, and the catalytic cores of all of the members 
of the class have conserved amino acid residues. In this form of the method, preferably the 
10 designing step comprises designing an affector having homologous topography and charge 

fields that complement the catalytic core of the first enzyme and capable of inducing a 
conformation wherein the conserved amino acid residues of the first enzyme are in 
homologous locarions to the second enzyme in the first complex- The affectors can be 
inhibitors, activators or other affectors of enzyme activity. The first affector can be all or 
15 a portion of the first enzyme, and the first complex can be a holoenzyme. The class of 

enzymes can comprise protein kinases or any other suitable class. The second enzyme can 
be a viral oncogene product or a cellular homologue thereof, such as p60 v-Src from RSV 
or its cellular homologue, pp60 c-src. The second enzyme can also be cAMP-dependent 
protein kinase. The second enzyme can be a native mammalian protein kinase or a 
2 0 recombinant protein kinase. In a preferred form of the method, the designing step 

comprises identifying a potential affector Likely to induce a conformation on the catalytic 
core of the first enzyme which is homologous to the conformation of the catalytic core of 
the second enzyme in the first complex, when the affector is formed as a second complex 
with the first enzyme, and determining whether the potential affector induces the 

2 5 conformation through the use of a technique selected from the group consisting of computer 

modelling, small angle neutron scattering, and circular dichroism. In this preferred method, 
the potential affector comprises a peptide, and the potential affector comprises a molecule 
selected from the group consisting of nucleotides, polynucleotides, alcohols, polymers, lipids 
carbohydrates, sugars, native or synthetic molecules, protein and organic compounds and 

3 0 combinations thereof. In accordance with this aspect of the invention, the method can 

include producing the affector. Thus, the present invention also includes the affector 
produced from the method. 

In another aspect of the present invention, there is provided another method of 
designing a highly specific affector which exerts an effect on the activity of a first enzyme. 
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In this method, the first enzyme is also a member of a class of enzymes having conserved 
residues at an affector binding site. This method comprises the following steps: identifying 
a second enzyme that is a member of the class in which a first affector can affect the activity 
of the second enzyme, the first affector having a dissociation constant with the second 
5 enzyme of less than 1 pM, forming a first complex of the first affector and the second 

enzyme, obtaining data regarding the conformation of the affector binding site of the second 
enzyme in the first complex, and designing an affector which induces a conformation on the 
affector binding site of the first enzyme which is homologous to the conformation of the 
affector binding site of the second enzyme in the first complex, when the affector is formed 
10 as a second complex with the first enzyme. In one form of this method, the class of 

enzymes has a nucleotide binding site and each of the affectors is capable of binding to the 
nucleotide binding site. 

In still another aspect of the present invention, there is provided another method of 
designing a highly specific affector which exerts an effect on the activity of a first enzyme. 
15 In this method, the first enzyme is also a member of a class of enzymes having a conserved 

catalytic core. This method comprises the following steps: identifying a second enzyme that 
is a member of the class in which a first affector can affect the activity of the second 
enzyme, forming a first complex of the first affector and the second enzyme, the first 
complex having at least three points of contact between the first affector and second 
2 0 enzyme, obtaining data regarding the conformation of the catalytic core of the second 

enzyme in the first complex, and designing an affector which induces a conformation on the 
catalytic core of the first enzyme which is homologous to the conformation of the catalytic 
core of the second enzyme in the first complex, when the affector is formed as a second 
complex with the first enzyme. 
15 Stiu another aspect of the present invention provides a crystallized protein 

kinase/affector complex having stable decay characteristics over 15 minutes and a 
crystallized protein kinase/affector complex having a Bragg spacing diffraction limit of less 
than 4A. Preferably, the crystallized protein kinase of this aspect of the invention exhibits 
both of these characteristics. The present invention also provides a crystallized complex of 
0 a cAMP-dependent protein kinase and an inhibitor thereof This crystallized complex can 

be used in an X-ray crystallography procedure to produce data regarding the three 
dimensional structure of the cAMP-dependent protein kinase in the complex, and this data 
can be used for designing an inhibitor of a second protein kinase which will induce a similar 
three dimensional structure on the active site of the second protein kinase as the three 
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dimensional structure of the cAMP-dependent protein kinase in the complex. Thus, the 
present invention also includes an inhibitor designed by this method* 

Another preferred method of the present invention involves preparing a highly 
specific affector of a first enzyme, with the first enzyme being a member of a class of 
5 enzymes having a conserved catalytic core with a plurality of invariant residues. This 

method comprises the following steps: identifying a second enzyme that is a member of the 
class and having a known affector thereof, forming a first complex of the second enzyme and 
the known affector, obtaining data regarding the three dimensional coordinates of the 
invariant residues in the first complex, the coordinates forming a template, generating a 
20 model wherein the first enzyme is in a conformation in which the invariant residues are in 

substantially the same conformation as in the template, identifying a change in the variable 
residues in the catalytic core of the first enzyme in the conformation of the template when 
compared to the variable residues in the catalytic core of the second enzyme in the 
conformation of the template, preparing a modified form of the second enzyme, wherein the 
15 modified second enzyme includes a non-conserved change identified through this method, 

and designing an affector which computer modelling indicates will induce a conformation 
on the catalytic core of the first enzyme in which the invariant residues are in substantially 
the same coordinates as the coordinates of the template, when the first enzyme is formed 
as a second complex with the affector designed in this step. Preferably the identified 
2 0 change is a non-conserved change in the variable residues. In a preferred form of this 

method, the method also includes forming a third complex of the modified second enzyme 
and an affector capable of binding thereto, obtaining data regarding the three dimensional 
coordinates of the invariant residues in the third complex, and using the data obtained in 
the previous step to design an affector which computer modelling indicates will induce a 
25 conformation on the catalytic core of the first enzyme in which the invariant residues are 

closer to the coordinates of the template than the conformation induced by the affector 
designed previously, when the first enzyme is formed as a fourth complex with the affector 
designed in this step. The affector used for computer modelling can be the known affector. 
Preferably, the method also includes modifying the computer modelling in light of the data 
obtained through the method prior to designing the affector. Amino acid sequence data 
relating to the catalytic cores of the first and second engines is preferably obtained. Site 
directed mutagenesis of a recombinantly produced second enzyme can be used in 
accordance with the method. In one preferred aspect of this method, the coordinates of 
the template are substantially as shown in Figure 17 and the template can include 
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coordinates separated by the distances substantially as shown in Table 4. The affectors can 
be inhibitors or other affectors. The method can also include preparing the designed 
affector. Thus, the present invention also includes the affectors prepared through this 
method, and also includes pharmaceutical compositions containing these affectors. 
5 The present invention also includes a method of designing a specific inhibitor for a 

protein kinase, comprising the following steps: obtaining data regarding the three- 
dimensional structure of a first protein kinase, and using the data in the design of an 
inhibitor for a second, different, protein kinase. The first protein kinase is preferably 
cAMP dependent protein kinase or an analogue thereof. The obtaining step preferably 
10 comprises crystallizing a complex of cAMP dependent protein kinase and an inhibitor 
thereof, and additionally includes obtaining X-ray crystallography data on the crystallized 
complex obtained from the crystallizing step. Thus, information regarding the structure of 
the crystallized complex obtained in the crystallizing step through circular dichroism, small 
angle neutron scattering, or other chemical procedures can be obtained. 
15 I n a preferred form of the present invention, there is provided the use of the data 

of Figure 17 or of Table 4 in the design of an affector for a protein kinase. 

Still another aspect of the present invention involves a method of preparing a highly 
specific inhibitor of a first enzyme. The first enzyme is a member of a class of enzymes 
having a conserved catalytic core with a plurality of invariant residues. This method 
2 0 includes the following steps: (a) identifying a second enzyme that is a member of the class 

and having a known first inhibitor thereof, (b) forming a first complex of the second enzyme 
and the first inhibitor, (c) obtaining data regarding the three dimensional coordinates of the 
invariant residues in the first complex, (d) designing a second inhibitor which computer 
modelling indicates will induce a conformation on the catalytic core of the first enzyme in 
25 which the invariant residues are in substantially the same coordinates as the coordinates 

obtained in step (c), when the first enzyme is formed as a second complex with the second 
inhibitor, (e) preparing the second inhibitor, (f) forming a third complex of the second 
inhibitor and a third enzyme complexable therewith, the third enzyme having a plurality of 
the invariant residues, (g) obtaining data regarding the three dimensional coordinates of the 
3 0 invariant residues in the third complex, and (h) using the data obtained from step (g) to 

design a third inhibitor which computer modelling indicates will induce a conformation on 
the catalytic core of the first enzyme closer to that in which the invariant residues are in 
substantial the same coordinates as the coordinates obtained in step (c) than induced by 
the second inhibitor, when the first enzyme is formed as a fourth complex with the third 
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inhibitor. This first inhibitor is in one embodiment of this method an inhibitory domain of 
the second enzyme. The third enzyme preferably contains at least 5 invariant residues, and 
can be a naturally occurring enzyme or a mutant enzyme. 

Still another aspect of the present invention involves a method of determining the 
5 efficacy of a first affector in affecting a first enzyme that is a member of a class of enzymes 

having a plurality of invariant residues among the members of the class. This method 
includes the following steps: determining the three dimensional coordinates of the invariant 
residues of a second enzyme in a first conformation wherein the second enzyme is in a 
complex with a second affector that is a strong affector of the enzyme, determining the three 
10 dimensional coordinates of the invariant residues of the second enzyme in a second 

conformation wherein the enzyme is in a conformation other than the first conformation, 
identifying the mobile invariant residues of the enzyme, the mobile invariant residues being 
those invariant residues at coordinates substantially different in the first conformation than 
in the second conformation, determining the three dimensional coordinates of the mobile 
15 invariant residues of the first enzyme when the first enzyme is in a conformation wherein 

the first enzyme is in a complex with the first affector, comparing the three dimensional 
coordinates of the mobile invariant residues of the first enzyme in the conformation with 
the coordinates of the mobile invariant residues of the enzyme in the first conformation, 
wherein relatively small differences in the coordinates indicate relatively high efficacy of the 
first affector. The step of determining the coordinates of the first enzyme in the 
conformation is preferably performed using computer modelling of the conformation. The 
steps of determining the first and second conformations preferably comprise obtaining X-ray 
crystaliographic data of the enzyme. The second conformation can be a conformation 
produced by a ternary complex, such as one comprising a protein kinase, a nucleotide and 
an affector. The second conformation can also be a conformation produced by the second 
enzyme not complexed with a Iigand, or the same enzyme as the first enzyme. 

In an additional aspect of the present invention, there is provided a method of 
designing specific inhibitors of a first protein kinase having a template defined by a plurality 
of invariant residues present in substantially all protein kinases. This method comprises the 
following steps: obtaining data regarding the three dimensional coordinates of the invariant 
residues of a second protein kinase and of the points of contact between the second protein 
kinase and a known inhibitor thereof, the coordinates being obtained when the second 
protein kinase is formed as a complex with the known inhibitor, generating a model of the 
first protein complex wherein the template is defined by the positions of the invariant 
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residues in the complex, examining the amino acid residues present in the first protein 
kinase at positions corresponding to the points of contact in the complex, and designing an 
inhibitor of the first protein kinase capable of forming ionic and hydrophobic interactions 
with the amino acid residues. The method of Claim 62, wherein the second protein kinase 
5 is cAMP dependent protein kinase. The known inhibitor can be PKI(5-24). For this known 

inhibitor, the points of contact in the complex preferably comprise positions corresponding 
to the positions selected from the group consisting of P+ 1, P-2, P-3, P-6 and P-l 1 along the 
known inhibitor. The positions corresponding to the points of contact in the examining step 
preferably comprise positions within a sphere having a radius of 11 A, more preferably 6 

10 A, from the coordinates of the point of contact obtained in the obtaining step. The 

designing step preferably additionally comprises designing the inhibitor to form appropriate 
hydrogen bonding with the amino acid residues. 

Further details concerning the present invention are provided in the following 
detailed description, 

15 Detailed Description of the Invention 

CITED REFERENCES INCORPORATED BY REFERENCE 

A number of articles are specifically cited herein as providing background 
information useful, but not essential to those of ordinary skill in the art in the practice of 
the present invention. As such, the disclosure of each of these articles is hereby explicitly 

2 0 incorporated by reference. 

INTRODUCTION 

The protein kinase family of enzymes is used as a model for this invention, These 
enzymes are involved at all levels of regulation in the eukaryotic cell. They act as 
"transistors" for the cell, receiving signals and amplifying the message inside the cell. 
25 Protein kinases receive hormone signals from outside the cell. They are involved in cell 

growth, for cellular homeostasis, and for triggering the steps of mitosis. 

In addition, many oncogenes code for protein kinases. These oncogenic protein 
kinases are also very diverse in their structure and location within the cell. However, all are 
derived from normal cellular components and all, in one way or another are defective in 

3 0 their ability to be turned off. In other words, they are constitutively active in contrast to 

their protooncogene counterparts which are turned off in the absence of the appropriate 
signal. Thus, protein kinases are not only an essential part of normal cell growth and 
division, but, can lead to oncogenesis when their normal function becomes genetically 
impaired. 
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Diversity is a hallmark of the protein kinase family. For example, growth factor 
receptors, such as the insulin receptor, are large proteins with a major extracellular domain 
for binding growth factor, a single membrane spanning domain, and an intracellular protein 
kinase domain that is activated in response to growth factor binding. The kinase activity is 
5 limited to a specific domain of the protein. Control of the insulin receptor may play an 

important role in the control of diabetes. Protein kinase C is activated by diacyl glycerol 
and Ca 2+ and is also activated by the tumor promoting phorbol esters. It is a cytoplasmic 
protein that in its active state is associated with the plasma membrane^ Another protein 
kinase, cdc2, associates with cyclin B and is an essential trigger for mitosis. The 
10 transforming protein in Rous Sarcoma Virus, pp60 v ' src is anchored to the cytoplasm surface 

of membranes. In spite of the diversity in size, subunit composition, location in the cell, and 
mechanism of activation, all protein kinases share a common enrymatic activity and a 
conserved catalytic core, indicating that all have likely evolved from a common functional 
precursor. Thus, one aspect of the present invention provides a method for developing 
15 highly selective inhibitors for members of the protein kinase family. 

The first protein kinase to be purified was phosphoryiase kinase. The second was 
phosphatase kinase kinase, later renamed cAMP-dependent protein kinase 
(EC2.7J.37:ATP:protein serine phosphotransferase) when its broader substrate specificity 
was appreciated. Not only was cAMP-dependent protein kinase (cAPK) one of the first 
2 0 protein kinases to be characterized, it also is one of the simplest and best understood 

biochemically. Its simplicity is due primarily to its mechanism of activation, which involves 
subunit dissociation. With the exception of the oncogenic enzymes, all protein kinases 
typically are maintained in an inactive state in the absence of the appropriate activating 
signal. In the case of cAPK, the ligand triggering activation is cAMP, one of the first 
1 5 recognized second messengers for hormone signalling. In the absence of cAMP, the enzyme 

is sequestered as an inactive holoenzyme containing two regulatory (R) and two catalytic 
(C) subunits. When intracellular cAMP levels are elevated, the cyclic nucleotide binds to 
the R^subunit, thus causing the complex to dissociate into a R 2 dimer and two free and 
active C-subunits. The general consensus sequence recognized by the C-subunit is Arg-Arg- 
0 X-Ser/Thr-Y, where X is any small residue and Y is a large hydrophobic group. The 

conserved catalytic core found in all protein kinases is contained within this relatively simple 
monomeric C-subunit. 

This invention provides the first crystal structure of a protein kinase with its catalytic 
subunit intact. Knowledge of the conformation of the catalytic structure of cAPK is central 
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to the understanding of protein kinase activity. Not only is the structure of the cAMP- 
dependent protein kinase catalytic site provided, but, the crystals contain a bound inhibitor 
peptide* This inhibitor peptide, PKI(5-24), is a fragment of the heat stable protein kinase 
inhibitor (PKI). This peptide includes the consensus features common to all peptide 
5 substrates and inhibitors of cAMP-dependent protein kinase. In addition, it contains other 

features that convey unique high affinity binding characteristics. Thus, precise properties 
of binding and interaction are described. From this data, a template is derived from which 
all other protein kinases can be modelled and from which other inhibitors can be designed. 
One of the more important questions regarding protein phosphorylation is how the 
10 targeted protein substrate is recognized by a specific protein kinase. This question has 

remained particularly elusive until now because the determinants for peptide recognition are 
widely dispersed and in some cases well-removed from the actual site of phosphotransfer. 
Owing to its simplicity as well as its relative ease of purification, the catalytic or C-subunit 
of cAMP-dependent protein kinase serves here as a prototype for identifying functional sites 
15 that are involved in substrate recognition and catalysis. Chemical analyses and procedures, 

such as affinity labeling, group specific labeling, and fluorescence energy transfer all have 
provided clues about regions involved in peptide recognition, MgATP binding, and catalysis. 
Substrate analogues provide indirect information about binding sites important for affector 
molecule specificity. Further NMR, circular dichroism, small angle neutron scattering 
2 0 (SANS) and other chemical procedures offer further insight into the structure of the 

enzyme. However, X-ray crystallography provides a comprehensive three dimensional 
structure that can confirm and integrate these other techniques. 

The expression of the C-subunit in E. coli not only facilitated these structural studies, 
but also has permitted recombinant approaches to be used to further modify the active site 
15 of cAPK and thereby mimic the reactive site of other protein kinases. Information to aid 

in these studies is obtained from sequence data avai'- t - 1 - for the protein kinase family. 
Hanks et al. f Science 241: 42, 1988, is one source of such data. Such sequence comparisons 
have identified highly conserved regions including several invariant residues, variable 
regions, and places where inserts and deletions can be tolerated. Both chemical and 
0 sequence information aie used here to verify the structure data obtained from the X-ray 

diffraction studies. As will be disclosed herein, this body of information permits the design 
of other affector molecules specific for other protein kinases. Further, this information 
serves as guidelines for the design of specific affector molecules for enzymes from a wide 
variety of enzyme families. 
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The existing basis for the design of specific inhibitors for protein kinases, in the 
absence of the three dimensional structure provided herein, relies on the use of synthetic 
peptides based primarily on the sequences of known substrates and inhibitors. In the case 
of cAMP-dependent protein kinase, there are some very specific high affinity peptides 
5 available. Existing inhibitors also include nucleotide and nucleoside derived compounds 

found through traditional means. However, these nucleoside and nucleotide inhibitors do 
not generally exhibit the type of specificity observed with peptide inhibitors. In general, 
such specific peptide inhibitors are not available for other protein kinases. Specificity for 
cAMP-dependent kinase improves with the addition of amino acids postulated to lie outside 
10 of the catalytic core. We have discovered that these regions are also important for inhibitor 

design. Knowledge of these sites provides a lock" to permit for the first time the tailoring 
of inhibitors for any given protein kinase. Thus, one important aspect of the invention lies 
in the design of the "lock", that requires an understanding of the three dimensional structure 
of the complex of the catalytic subunit of cAMP-dependent protein kinase, with its very 
15 potent specific inhibitor, PKI(5-24). 

Disclosed herein is a template gleaned from the crystal structure of the catalytic 
subunit of cAMP-dependent protein kinase. Just as the chemical information derived from 
the C-subunit serves as a framework for interpreting the entire kinase family, the structure 
of cAPK provides information for the creation of a template for viewing the conserved 
2 0 catalytic core of all eukaryotic protein kinases. This invention further provides a model for 

the identification and design of molecules capable of interacting with the catalytic core of 
a given enzyme by analyzing the conserved catalytic core of another member of that enzyme 
class. 

X-RAY CRYSTALLOGRAPHY 

25 X-ray crystallography permits three dimensional molecular analysis of a protein at 

the atomic level. Analysis requires the production of crystals and crystal production requires 
a pure concentrated product. Further, complexes of a protein of interest together with a 
second interacting molecule provides information on the conformational changes occurring 
within a protein in response to that second molecule. X-ray crystallography of a protein 

3 0 with its substrate, an antibody or a drug can provide information for rational drug design. 

An X-ray diffraction pattern taken from a crystal looks like an array of spots of 
varying intensities. Each spot is related to one of the Fourier coefficients of the electron 
density pattern in the crystal Thus, the electron density in the crystal can be reconstructed 
if a sufficient number of diffraction spots can be measured and the relative phase angles of 
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the Fourier coefficients can be determined. Thus, a crystallized enzyme used in the practice 
of certain aspects of the present invention should be of sufficient quality to obtain these 
measurements. For example, the spots of varying intensity in the diffraction pattern decay 
over time. It is quite difficult to work with diffraction patterns with half lives of less than 
5 10 hours. However, it is possible to work with diffraction patterns having half lives as short 

as about 15 minutes to 3 hours, depending on the amount of structural data desired to be 
obtained. Further, it is believed possible to work with crystals of even shorter half lives 
using equipment and computer programs more advanced than commonly available today. 
Additionally, not all crystals are of equal quality and poor crystals have large Bragg spacing 
10 diffraction limits. Thus, a workable crystal should have a Bragg spacing diffraction limit of 

less than 4 A. 

Determination of phase angles uses isomorphic replacement to insert atoms into 
defined positions in the crystal for diffraction data measurement. These angles provide 
information that permit the production of an electron density map. The map is then used 
15 to build an atomic model from which three-dimensional coordinates are measured that 

define the structure of the crystallized molecule. 
MODEL SYSTEM 

X-ray crystallography has been employed for the rational design of drugs and other 
interacting molecules. However, to date, the rational design of affector molecules has been 
2 0 limited to a study of the active site of the protein /affector molecule interaction. Potential 

affectors designed from this information have not been obtained by looking at interactions 
beyond the active site. We believe that these interactions assist in binding and thereby 
contribute to binding specificity. Thus, we have discovered that it is these interactions in 
concert with information obtained from the active site that make the design of specific 
2 5 affector molecules a possibility. Moreover, this information additionally permits the design 

of specific affector molecules for related but noniden' ; "*> enzymes. 

Many enzymes within a cell have evolved from common progenitors. These enzymes 
share common enzymatic activities and one example is the protein kinase family. Since the 
functions of the enzymatic families or classes are broadly conserved, at least a portion of the 
30 catalytic site is also conserved. Therefore to a large extent rational drug design relies on 

the identification of the familial similarities and hence drugs are designed to react broadly 
within a given family or class. 

While all members of an enzymatic class may provide a similar activity, such as 
phosphorylation or dephosphorylation. each member may have only one specific target. 
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Thus, successful rational drug design based on group similarities would provide molecules 
that also interact broadly. Where there are many members of an errzymatic class acting 
specifically within a restricted locale such as a single cell, a broadly acting drug would 
interact with any number of enzymes from the group. Thus, the interaction would be 
5 general and not specific. In contrast, the present invention advantageously is capable of 

providing affectors with highly specific interactions for a given member of an enzyme class. 

Previous methods for rational drug design require the crystallization of the target 
molecule of interest. However, the production of useful crystals is both difficult and time 
consuming. It first depends on the ability of the target molecule to be isolated and purified 
10 in sufficient quantity for crystallization- A large number of crystallization conditions often 

need to be tested and once a crystal is made that is of sufficient quality, additional crystals 
often need to be produced in order to have enough material for analysis. Further, not all 
molecules are readily purified or readily crystallized. Advantageously, the present invention 
discloses a method whereby only one enzyme within a family of enzymes need be 
15 crystallized. 

This invention teaches a method for the identification and design of specific 
molecules interacting with a specific enzyme wherein the specific enzyme is a member of 
a broadly acting enzymatic group or class. 

The particular enzyme class chosen for this invention is preferably one that has the 
2 0 characteristics generally associated with an enzyme class developed from a divergent 

evolutionary pathway. That is, an enzyme class in which it is possible to identify similarities 
within the catalytic core of all members of the class. Enzymes with similar activities that 
have evolved from convergent evolution will not necessarily share these constant residues 
and a model or template employing invariant amino acids as anchors would then not be 
2 5 possible. A variety of enzyme families are postulated to arise from divergent evolution, and 

thus would be expected to serve as a preferred class of emymes for design of affector 
molecules within the context of the present invention. Such enzyme families include, but 
are not limited to, the protein kinases, phosphoryiases, and several groups of proteases. 

For purposes of illustration only, the present invention is described using the protein 
kinase family as a model system. As discussed above, these enzymes are essential for many 
aspects of cell regulation. Over 100 individual protein kinases have been identified. Thus, 
the successful design of affectors to manipulate the activity of a kinase can provides an 
invaluable tool for research as well as for the design of a wide variety of therapeutics and 
diagnostics. 
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Thus, for example, development of effective specific inhibitors of oncogenic kinases 
is believed to lead to the development of antineoplastic treatments. In addition, specific 
inhibitors of kinases involved in hormone regulation will be useful in artificially regulating 
the secretion and regulation of such hormones. Also, since many neurotransmitters are 
5 regulated by kinases, development of new affectors could potentially impact on diseases of 

the nervous system. Further, platelet aggregation and clot formation might also be regulated 
through novel affectors of kinases developed through the methods of the present invention. 
Many other therapeutics are believed possible through the development of novel specific 
affector molecules. 

10 The model system used in connection with this invention uses cAMP-dependent 

protein kinase together with a 20 amino acid inhibitor peptide, PKI(5-24), to establish a 
"lock" for specific affector design. This inhibitor is unique in that it interacts only with the 
cAMP-dependent protein kinase. Therefore, cocrystallization of this inhibitor with cAMP- 
dependent protein kinase permits the visualization of the conformation of an enzyme in 
15 association with its specific inhibitor. 

The general requirements for peptide recognition by the catalytic subunit were 
characterized originally using synthetic peptides. From those studies evolved a general 
consensus sequence that includes two basic residues, typically arginine, followed by one 
intervening small residue preceding the site of phosphotransfer. The phosphate acceptor 
2 0 site, either Ser or Thr, is followed immediately by a large hydrophobic residue. The typical 

peptide substrate used for this kinase, based on the phosphorylation site in pyruvate kinase, 
is shown in 1. Replacement of the Ser with an Ala generates an inhibitor peptide with poor 
affinity. In contrast is the inhibitor peptide, PKI(5-24), derived from the thermostable 
protein kinase inhibitor (PKI). This peptide, seen in 1, encompasses the consensus 
25 sequences described above but also contains additional unique features that convey high 

affinity binding. The particular features that contribute to high affinity binding were defined 
using peptide analogues, and the most relevant ones are also indicated in 1. A general 
structure of the peptide in solution was deduced using circular dichroism (CD) and NMR 
spectroscopy. The peptide, PKI(5-24), was co-crystallized with the catalytic subunit of cAPK, 
JO and the structure of that peptide as well as its interaction with the protein are discussed 

here. 

The folding of the polypeptide chain and the mechanism of catalysis is conserved in 
all protein kinases. There are 8-9 invariant residues scattered throughout the core for all 
protein kinases. The crystal structure reveals that most of these invariant residues are 



BNSDOCID. <WO 9302209A1> 



WO 93/02209 



o 



-16- 

clustered three dimensionally around the site of catalysis providing an interconnected 
network. The regions involved in peptide recognition extend over a wide area on the 
surface of the enzyme and until this structure was solved there was no understanding of the 
details of the peptide recognition sites. The structure of the catalytic subunit thus serves 
5 as a framework from which a template for the entire protein kinases family can be 

produced. This structure provides, for the first time, a true molecular basis for the design 
of affectors that will selectively target any given protein kinase. Thus, it is an object of this 
invention to provide a method for the identification and design of molecules interacting with 
the catalytic core of a protein kinase by preparing a template from the analysis of the 
10 catalytic core of the cAMP dependent protein kinase. 

The ability to design affector molecules that act on a given enzyme using information 
obtained by X-ray crystallography is dependent on the formation of crystals of purified 
enzyme. Methods for crystal production vary greatly and one cannot predict how readily 
a given molecule or complex will crystalize. However, those skilled in the art will recognize 
15 that a variety of methods for crystallizing can be attempted for any given enzyme, and that 

successful crystallization can be expected of a variety of enzymes. Rational drug design 
additionally requires information about the interaction of a known affector in order to 
accurately predict a potential affector's effect on the catalytic core of the enzyme. Thus, 
crystals of the complex of affector molecule and enzyme together are used to gather 
2 0 information on the conformation of the enzyme in its inhibited conformation. Thus, in 

addition to information about the catalytic core of an enzyme family and the identification 
of additional sites adjacent to the core that permit the specific design of inhibitors, the 
present invention provides an improved method for the crystallization of complexes. 

An important feature of certain aspects of this invention is the production of an 
15 enzyme/affector template. In order to generate this template, the affector chosen for 

production of enzyme/affector complex should have a high affinity for a particular enzyme. 
The initial affector molecule chosen should preferably have a Kd less than 1 fiM, and more 
preferably less than 100 nM, in order to provide a conformation resulting from high affinity 
interactions. Once the specific interactions are understood it is contemplated that affector 
0 molecules having a variety of Kd ranges could be selectively designed for various purposes. 

Thus, in the model system chosen to illustrate this invention, PKI(5-24) is used as an 
affector of cAPK, with a Kd of approximately 60 nM. Those of ordinary skill in the art will 
recognize that other inhibitors with Kd less than 100 nM could also have been chosen to 
illustrate this aspect of the present invention. 
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For example, cGMP dependent protein kinase has an inhibitor with a Ki of 
approximately 6 nM, and an inhibitor for cAPK described by Ricouart et al. is characterized 
in the 4 nM range (J. Med. Chem., 34: 73-78, 1991). These Ki values are roughly equivalent, 
however, not identical, to the expected dissociation constants (Kd's). 
5 The PKI(5-24) inhibitor peptide used here is highly specific and is rather large in 

that it extends beyond the catalytic core. Other known peptides and affector molecules for 
kinases are not as specific. We have discovered that the interactions beyond the catalytic 
core provide the high specificity of PKI(5-24) for cAPK. 

As stated above, it is the crystallization of the enzyme with its specific inhibitor 
10 together with the analysis of the relationship of the inhibitor to both the catalytic core and 

to areas surrounding the core that provide data for the particular protein kinase "lock". The 
"lock" comprises the three dimensional structure and ionic, hydrophobic, hydrogen bonding 
and other interactions of the non-conserved variable residues with the specific affector 
structure. The lock is defined by the invariant residues of the exemplary structure. When 
15 an enzyme of the class is affected by a specific affector, the backbone atoms of these 

invariant residues must be in substantially the same relative coordinates in all members of 
the enzyme class. Thus, with knowledge of sequence information of the particular enzyme 
for which the affector is being designed, knowledge of the lock formed by the invariant 
residues can be obtained. The coordinates of the invariant residues position the variable 
2 0 residues of the lock in space and thereby permits the design of other specific inhibitors and 

affector molecules for other protein kinases. 

The lock consists of the site of phosphotransfer (P site) with recognition sites for 
flanking sequences. The flanking sites can be identified by the number of amino acid 
residues separating that site from the P site. Thus, the first amino acid residue in the 
direction moving toward the carboxy terminus is designated P+ 1, and the following residues 
are designated P+2, P+3, P+4 and so on. Simila the residues on the side moving 
toward the amino terminus are designated P-l, P-2 and so on. 

The sites for recognition of the peptide are not identical between members of the 
protein kinase family, and the chemical content is unique for each given protein kinase. The 
sequence of the given protein kinase is built into the coordinates of the C-subunit using the 
invariant residues. The position of these invariant residues can be identified using X-ray 
crystallographic data, such as the data disclosed herein in Figure 17. This data provides the 
coordinates for each non-hydrogen atom in cAPK. It is the locations of these invariant 
residues which serve to define the template common to all protein kinases. This template 
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can then be used to model the three dimensional coordinates of the variable as a basis to 
design highly specific affector molecules. 

The affector molecules to be designed could be polypeptides, nucleic acids and their 
analogues, combinations of nucleotides and peptides, organics or any other molecule capable 
5 of specific interaction with a given enzyme. The essence of the design of a specific inhibitor 

for a given protein kinase is based on the three-dimensional fit of the specific inhibitor into 
the provided "lock", or template, provided by the known structure of cAPfL 

The template defined by the invariant or other highly conserved residues can be used 
to define the region immediately flanking the phosphorylation site and, in addition, can 
10 incorporate more distant parts of the molecule to enhance specificity and affinity. The 

peptide recognition site serves in the same manner as the antigen recognition site of an 
antibody. This site extends over a large surface of the enzyme and provides a unique lock 
for the design of a wide variety of affector molecules, including both peptide and non- 
peptide affectors. Each particular protein kinase has a different and unique chemical 
15 content at each individual site. Thus, the "lock", is unique for each protein kinase. 

The "lock" of any particular en2yme represents a topological map with defined sites, 
positions of which vary between members of the enzyme class. As an analogy, each kinase 
can be thought of as functioning in a manner similar to a specific antibody in that it 
recognizes only a very specific set of proteins to phosphorylate. However, each of the 
2 0 kinases has a conserved template, the positions of which will not substantially vary between 

kinases in an inhibited conformation. Thus, using computer modelling together with known 
sequence information regarding a particular kinase, the invariant residues of the kinase can 
be placed in the template conformation, and the approximate positions of the variable 
residues can be predicted. 
- 5 The lock provides the information from which other specific affector molecules can 

be designed. It provides information on topology, charge interactions and the points of 
contact both within the catalytic core and around the core that suggest the design features 
important for the production or identification of novel affector molecules. Thus, the goal 
is to design an affector having homologous topography and charge fields that complement 
0 the catalytic core of the lock of the enzyme. Computer modelling can be used with these 

factors to design an affector capable of inducing a conformation where the conserved amino 
acid residues of the enzyme are in homologous locations to the template. 

BRIEF DESCRI PTION OF AFFECTOR PI STON 
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The basic steps toward achieving this invention are provided briefly here and in 
detail below. A class of enzymes is first identified wherein at least one enzyme of the class 
has a highly specific affector molecule. Then the inhibitor is tested for specificity and, 
preferably, the inhibitor sequence is reduced in size until a minimum sequence having the 
5 desired specificity is obtained. Sequence data from related enzymes is analyzed so that a 

consensus region that forms the catalytic core can be identified. Crystals of affector 
molecule together with the model enzyme are subjected to multiple isomorphous 
replacement techniques to prepare heavy atom derivatives. This permits the location of 
heavy atoms within the structure to be identified and additionally permits multiple 
10 diffraction patterns to be combined to deduce phase angles for calculation of the electron 

density of the structure. Those of ordinary skill in the art will recognize that other 
techniques can be used to deduce phase angles and to improve the accuracy of previously 
deduced phase angles. 

A three-dimensional structure can be obtained from the electron density data using 
15 a computer program such as TOM/FRODO. Further, a computer program, such as X- 

PLOR, can be used to improve the accuracy of the initial three-dimensional structure. 
There are a variety of computer programs available for analyzing X-ray crystallographic 
data. Those used in the development of the model system for this invention are cited herein. 
Those of ordinary skill in the art will recognize that many other such computer programs 
2 0 providing similar functions could also have been used. From this data, the points of contact 

are identified both within the catalytic core and the surrounding region. Invariant amino 
acids and consensus recognition sequences are identified. The data is further analyzed 
against available chemical data such as NMR, CD, SANS data and other data resulting from 
chemical procedures. This chemical data can provide additional information for the 

2 5 structural model. 

The coordinates of the invariant amino acids residing in the conserved catalytic core 
and the surrounding invariant residues in the enzyme/affector complex provide the template 
to be duplicated in other members of the enzyme class. The lock of the enzyme for which 
the affector is to be designed is then built by replacing the variable amino acids of the 

3 0 catalytic subunit of the template enzyme with the amino acids of the new enzyme. Any gaps 

in the sequence alignment between the enzyme used to generate the template and the 
enzyme for which the lock is being modelled generally occur within loops. These loop 
regions can be modelled separately using the structural data accumulated in a data bank, 
such as the Brookhaven data bank. The model of protein kinase can then be corrected and 
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refined using an energy minimization procedure and using molecular dynamics to eliminate 
stearic and electrostatic clashes. The resulting model of the catalytic core of the protein 
kinase under investigation is then inspected for amino acid content of the enzyme's surface 
which interacts with the proposed inhibitor. 

In accordance with one aspect of the present invention, another member of this 
enzymatic class can then be analyzed in the context of this template. If the new enzyme can 
be crystallized, then the information obtained from the crystallization is merged with the 
H lock" structure. However if the new enzyme is not accessible or is not crystallizable, the 
enzyme can still be incorporated into the three-dimensional lock. The ability to incorporate 
the new enzyme into the template is dependent on the identification of conserved residues 
within the catalytic core of the new enzyme that are complementary to the conserved 
residues in the model enzyme. The template establishes the coordinates for these residues 
in three-dimensional space as well as providing coordinates for the three-dimensional 
surface of the catalytic core and adjacent regions. 

For cAPK and the protein kinase family, the invariant residues are identified and 
summarized in the review by Hanks et al., supra . The template permits a comparison of the 
new enzyme catalytic core surface with cAPK. Residues within the catalytic core that are 
different from those of cAPK are studied to determine how those differences in the new 
enzyme might alter the surface of the core or change the structure of a new affector 
molecule. Recombinant cAPK can then be subjected to site-directed mutagenesis to change 
residues specific to cAPK into residues found in the new enzyme. This recombinant protein 
can be crystallized. 

A novel affector molecule can then be synthesized that complements the electrostatic 
charges and topography of both the catalytic core and identified surrounding regions of 
interest for the new enzyme. The points of contact, hydrophobic pockets, site of 
phosphotransfer, topography and stearic interactions are assessed and the affector molecule 
can then, if necessary, be subjected to random mutagenesis or site-directed mutagenesis to 
improve the affector/enzyme interaction. This model affector molecule together with 
recombinant mutated cAPK, are tested with the target enzyme for affector activity. The 
affector molecule is finally tested with the native new enzyme. Fluorescent tags bound to 
the affector can be used to assess binding to the new enzyme in the cell. Alterations in 
enzyme function can be detected by gel electrophoresis and complexes of enzyme and 
affector can be isolated and purified for further analysis. Thus, new enzyme purification and 
crystallization is not required for affector design. 
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As discussed above, generation of new affectors is not limited to peptides. A variety 
of chemically synthesizable compounds can be used. 

The model can be tested by a variety of methods. For example, kinetic 
determination of inhibition constants of novel inhibitors can be measured. Also, CD, SANS 
5 and other chemical procedures can be used to assess the extent of the conformation changes 

due to binding of the affector. If a mutated form of the enzyme has been prepared, 
cocrystallization of the affector with this mutated form can be performed and the points of 
contact can be determined and compared with the modelled points of contact. 
ENZYME FAMILY CHOICE AND IDENTIFICATION OF CATALYTIC CORE 
0 This invention relates particularly to enzyme families formed by divergent evolution. 

Once an enzyme family of interest is identified, an individual enzyme is chosen from a group 
of enzymes that share invariant residues within their postulated active sites. 

The enzymatic or active site within a given protein kinase can be broadly identified 
through biochemical means. When the enzyme exists as a group of subunits, enzymatic 
activity is often restricted to one of those subunits. Thus, prior to performing these 
biochemical means, the enzymatic subunit can be purified from the holoenzyme. The active 
site can be further localized by systematically reducing the subunit size and assessing enzyme 
activity with each reduction. In one method, the various mRNA sequences encoding the 
related enzymes are reversely transcribed and cloned. Sequence information can then be 
obtained from the catalytic region for a number of enzymes of the same class. Similar 
amino acid residues within the catalytic subunit are aligned in order to visualize homologous 
regions. Invariant amino residues can be identified among the class which are either present 
in all known members of the class or substantially all members of the class. At least a 
plurality of these invariant residues are believed necessary for enzyme activity within the 
catalytic subunit. Thus, the invariant residues can further define the catalytic core. 

For the protein kinase family, invariant amino acid residues are located within the 
catalytic core and are boxed in by a solid line in Figure 1. Figure 1 illustrates the general 
topological arrangement of the catalytic subunits from a number of protein kinases. 

Figure 2 illustrates that while the catalytic regions from members of the protein 
kinase family share some striking similarities, the placement of this active region within the 
enzyme, the size of the enzyme and the regulatory regions of the enzyme vary considerably. 
The conserved catalytic core is denoted in Figure 2 as solid black areas and regulatory 
regions are cross-hatched. Additional information regarding the use of cAMP-dependent 
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Protein Kinase as a model for the protein kinase family can be found in a review by S. 
Taylor (J. Biol. Chem. 264:8443-8446, 1989.). 

It is known from an analysis of the catalytic core of the protein kinase family that 
the core is included in a conserved 300-residue segment. Site-directed mutagenesis of 
recombinant enzyme sequences has been used to identify particular residues critical to 
enzymatic function. In the cAMP-dependent kinase an invariant lysine residue at position 
72 has been shown to be important by site-directed mutagenesis and a triad of glycines is 
thought to be associated with ATP-binding. 

The enzyme exists as a tetrameric holoenzyme composed of a dimer of regulatory 
subunits and two catalytic subunits. cAMP binds to the regulatory dimer yielding 
dissociation of the enzyme into an R2(cAMP) 4 complex and two active catalytic (C) 
subunits. It is the active C-subunit that phosphorylates serine or threonine residues on 
substrates having the consensus sequence Arg-Arg-X-S er /Thr-Leu . 
AFFECTOR MOLECULE FOR TEMPLATE DEVELOPMENT 

In a preferred form of the present invention, the enzyme used to establish the 
template or lock is, advantageously, a molecule that binds with high affinity to its affector, 
preferably with a dissociation constant less than 1 \xM. For example, there are many such 
known affectors, such as inhibitors and activators, of various protein kinases. Kinases with 
a regulatory subunit are known that are inhibited by a peptide encoding the regulatory 
subunit binding site. Similarly, kinases that possess an autoinhibitory portion are also 
know. Thus, for such a kinase, this autoinhibitory region could be cleaved away from the 
core enzyme, purified and analyzed to provide a minimal high-affinity inhibitory sequence. 

There are several inhibitors of cAMP-dependent kinase. The regulatory subunits can 
function as physiologic inhibitors as can the heat stable inhibitor protein (PKI). These 
inhibitors share a substrate-like sequence based on the arginine doublet, N-terminal to the 
position of the phosphorylation site in a normal substrate. Peptide fragments containing the 
consensus sequence bind the C-subunit in a manner analogous to a real substrate. PKI has 
an alanine in place of the phosphorylatable residue. While the PKI sequence is clearly 
inhibitory, the addition of a 15 residue stretch N-terminal to PKI increases inhibitory 
activity. Thus, residues external to the catalytic site are believed to be relevant in providing 
potent, high-affinity, inhibition and for improving the specificity of an inhibitor. 

A protein that is a high-affinity inhibitor of an enzyme can be dissected to find a 
smaller fragment, if it exists, that still contains high-affinity inhibitory activity. At least three 
factors are useful in this dissection: 1) an ability to produce chemicaUy defined fragments 
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of the larger inhibitor, either by synthesizing peptides or by cleaving the inhibitor with 
reagents such as cyanogen bromide or proteases, that cut at short amino acid sequences of 
a specific type for each reagent, 2) an ability to isolate specific fragments of the larger 
inhibitor from the mixture of fragments resulting from cleavage of the larger inhibitor, and 
5 3) an ability to assay chemical species for inhibition of the enzyme of interest. 

To carry out the isolation of a potential smaller inhibitory region of a larger 
inhibitor* one can cleave the inhibitor into fragments using a protease. Then one can 
separate the resulting fragments using HPLC and assay the fractions for high-affinity 
inhibition of the target enzyme. If no fraction is found that exhibits the desired inhibition, 
10 the cleaving reagent may have cleaved at a location that splits the inhibitory portion of the 

protein, destroying its ability to inhibit. In this case, it would be desirable to obtain other 
cleavage patterns until an inhibitory fragment is found. After obtaining the smallest possible 
inhibitory fragment using proteolytic cleavage of the intact inhibitor, one can chemically 
sequence the fragment as a step toward further defining the smallest fragment still having 
15 high-affinity inhibitory activity. With knowledge of the amino acid sequence, one can then 

use peptide synthesis to construct progressively shorter subsets of this fragment. These 
shorter subsets can then be assayed for inhibitory activity. Proceeding in this manner will 
thus allow definition of the smallest sequence, present in the larger inhibitor that still 
possesses high-affinity inhibitory activity toward the target enzyme. Methods for 
2 0 determining inhibition constants for tight-binding inhibitors are found in Biochem. J. 127: 

321-333, 1972 by P. Henderson. Methods for determining the inhibitory region PKI(5-24), 
are provided by Scott et al. in Proc. Natl Acad. Sci. USA 82:4379-4383, 1985. 
OBTAINING THREE DIMENSIONAL STRUCTURE DATA 

In order to obtain data on the conformation of the template of the enzyme formed 
15 by binding of the affector thereto, a variety of techniques can be used. These techniques 

include, circular dichroism, small angle neutron scatteri diffraction methods, including any 
combination of multiple and single isomorphous replacement, single or multiwavelength 
anomalous scattering methods, molecular replacement methods maximum entropy phasing, 
solvent-flattening methods and so-called "direct" methods used primarily to solve small- 
1 0 molecule structures, however, in the preferred embodiment, X-ray crystallography is used 

in order to generate specific coordinates for each of the non-hydrogen atoms in the complex. 
Coordinates for the hydrogen atoms could additionally be obtained using neutrons. Thus, 
following the isolation of an exemplary protein and affector and following or during the 
sequence analysis of related enzymes, crystals of enzyme and affector protein are generated. 
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The crystals can be generated from enzyme purified from natural tissue or from 
enzyme generated by recombinant means. Provided below are examples pertaining to the 
production of crystals using the recombinant mouse C a -subunit of cAMP dependent protein 
kinase and purified cAMP-dependent protein kinase from porcine heart. Nelson et at 
5 describe the purification schemes for porcine heart cAMP dependent kinase (J. BioL Chem. 

256:3743, 1981.) and Slice et al. disclose the methods for the generation of recombinant 
mouse C Q -subunit in E. coli (J. BioL Chem. 264:20940, 1989). The sequence data for cAPK 
was published by Uhler et al. (J. Biol. Chem. 261:15360-15363, 1986). 

The steady state kinetics of the C-subunit, purified from E. coli are identical to the 

10 mammalian C-subunit, although the E. coli protein is more labile to heat denaturation. 

Unlike the mammalian enzyme, the recombinant C-subunit lacks a myristoyl group at its 
amino terminus. For a review of protein crystallography see Protein Crystallography. 1976, 
T. Blundell and L.N. Johnson, Academic Press, New York. Information on circular 
dichroism and neutron scattering is found in Biophysical Chemistry. Part H: Techniques for 

15 the Study of Biological Structure and Function, C.R. Cantor et al. (W.H. Freeman and Co., 

San Francisco, 1980). 

Example 1 
Porcine Heart C-subunit Crystal Forms 
Reagents were obtained from the following sources: threo-1, 4-dimercapto-2,3- 
2 0 butanediol (DTT, dithiothreitol; Aldrich, Milwaukee, WI); N,N-bis(2-hydroxyethyI)gIycine 

(Bicine; Aldrich); methanol (Fisher Scientific); ammonium acetate (Aldrich); polyethylene 
glycol (Dow, Midland, MI). 

The peptide inhibitor PKI(5-24) was synthesized at the La Jolla Cancer Research 
Foundation (La Jolla, CA) and modified in our laboratory These modifications are 
25 described in detail below. The sequences of the peptide inhibitors are: (l)PKI(5-24); 

TTYADFIASGRTGRRNAIHD, (2)PKI(5-24), tyrosine iodinated: 
TTY*ADFIASGRTGRRNAIHD. The peptide sequence abbreviations follow either of the 
two standard abbreviation schemes for amino acids; the three letter code or the single 
capital letter designation. Both are standard abbreviations and are well understood by those 
3 0 of skill in the art. 

The porcine C-subunit was purified to a single band on SDS-poiyacrylamide gels and 
used for crystallization. Two crystal forms were prepared. Photographs of the porcine heart 
apoenzyme (cubic); and the porcine heart C:MgATT:PK3(5-24) ternary complex (hexagonal) 
are provided as Figures 18A and 18B. 
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The first crystal form used the hanging-drop vapor diffusion method. A drop of 
protein mixed with precipitating agents is suspended from a microscope cover slip and 
allowed to equilibrate through the gas phase against a larger reservoir. 

Specifically, 30-uL drops of 3-4 mg/mL protein solution were suspended and allowed 
5 to equilibrate against approximately lmL of reservoir solution in wells of plastic Linbro 

tissue culture trays over a time of several days to weeks. Both new forms, as well as the 
earlier P2j form, were grown at 4°C. The porcine heart C-subunit was concentrated to 8-12 
mg/mL and subjected to a final dialysis before attempting crystallization. The specific 
recipe for obtaining the first new form was the following: drop- 1/3 protein in 50 mM 
10 (NH 4 ) 2 HP0 4 and 5 mM 2-mercaptoethanoI (pH 8.0-8.2); 1/3 150 mM NH 4 CH 3 COO, 50 

mM (NH 4 ) 2 HP0 4 , and 10 mM dithiothreitol (DTT) (pH 8.1-8.2); and 1/3 reservoir 
composed of 8-9% PEG-400, 17-20% MeOH, and 10 mM DTT. Crystals of the second form 
were obtained from the same conditions as the first new form when the drop contained, in 
addition to the protein, MgATP and a 20-residue peptide inhibitor [PKI(5-24)J in the molar 
15 ratio 20:5:1:1 ATP: Mg^PKI (5-24) :C-subunit. The same crystal form was subsequently 

grown from a drop containing 1/3 protein in 50 mM bicine, 100 mM NH 4 CH 3 COO, and 
5 mM 2-mercaptoethanol (pH 83); 1/3 MgATP and PKI(5-24) in 10 mM DTT in the same 
ratio to protein as before; and 1/3 8 mM DTT and 8% PEG-400. The reservoir contained 
8% PEG-400, 15-20% MeOH and 7mM DTT. The first new crystal form could also be 

2 0 grown in the presence of the Mg* + and the non-hydrolyzable ATP analogue adenosine 5'p, 

Y-methylenetriphosphate (AMP-PCP). The second new crystal form, representing the 
ternary complex, could be grown with CoCl 2 or CdCl 2 substituted for MgCl 2 in the 
crystallization. The transition from one crystal form to another caused only by addition of 
MgATP and the peptide inhibitor PKI(5-24) suggests that a significant conformation change 
25 may occur upon their binding. 

The space groups of the new crystal forms were determined to be P4 X 32 (cubic) 
(Figure 18A), and P6 X 22 (hexagonal) (Figure 18B), respectively. Space groups were 
determined and all diffraction data were measured at the University of California, San 
Diego Research Resource Laboratory at 4 6 C using graphite-monochromated CuK a X-rays 

3 0 from either the Mark II Elliot GX-6 rotating anode diffractometer operating at 2 kilowatts 

or the Mark in Rigaku RU-200 rotating anode diffractometer (available from Rjgaku USA, 
Danvers, MA) operating at 5 kilowatts, each equipped with two Xuong-Hamiin multhvire 
area detectors (available from San Diego Multiwire Systems, San Diego, CA). 
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Preferably, area detector data collection is used. One facility offering equipment to 
support this data collection technique is The Resource Research Laboratory. This facility 
is a geographically designated, NIH supported facility to promote the use of X-ray 
crystallographic techniques. In speed, signal-to-noise ratio, and data precision, area detector 
5 data collection far surpasses standard diffractometer or film data collection. On the 

average, data collection is 50 times faster; consequently complete high resolution data sets 
can frequently be collected from a single crystal in one or two days. The space groups and 
lattice constraints of the crystal forms were determined to be the following: P4j32, a=b=c= 
169.24 A; and P6j22, a=b=80.3 A, c =293.0 A. Calculations using an average reciprocal 
1 0 density of 2.7 A 3 /D yield to the nearest unit 2 and 1 C-subunit monomers/asymmetric unit. 

The P4 2 32 form diffracts typically to 3.2 A. Pictures of the cubic and hexagonal crystal forms 
can be seen in Figure 18 

Because the P6 2 22 crystal form had diffraction better in extent and decay 
characteristics than the P4!32 form and because of the greater biochemical interest of a 
L5 ternary complex, work concentrated on solving the hexagonal ternary complex crystal form. 

The Jack of phase angles for a similar protein structure prohibited an initial structure 
solution for the C-subunit in the P6 2 22 form using molecular replacement techniques, so a 
structure solution using standard multiple isomorphous replacement (MIR) techniques was 
attempted. Both of these are techniques known to those of ordinary skill in this art. Briefly 
0 MIR involves introduction into the space group asymmetric unit of a relatively heavy 

reference atom that, after being located through difference Patterson analysis, enables the 
needed phase angles to be determined. The reference atom can be found with 6-A data, and 
with its location and generation of phase angles the fundamental crystallographic problem 
of a protein structure solution is solved and an electron density map can be calculated. 
Subsequent work on a protein structure focuses on incrementally improving the degree of 
detail visible in the electron density map through acquisition of higher resolution data and 
accompanying phase angles. 

The procedure used to search for heavy-atom derivatives was to soak or co- 
crystallize C-subunit with heavy atoms based on the empirical success record or various 
heavy atom reagents and on known C-subunit chemical information, such as the availability 
of two free thiol groups and the obligatory use of a divalent cation in catalysis. Soaks in Au, 
Hg and Pt compounds yielded precession picture diffraction changes but uninterpretable 
6A difference Patterson maps. A 4.8-A data set from a Na 2 U 2 0 7 soak yielded an apparent 
Patterson solution through examination of isomorphous difference and (1/variance)- 
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weighted anomalous difference Patterson maps, but the site quality was not high and 
attempts to reproduce or improve the soak failed. Isomorphous crystals grown with Co 2+ 
or Cd 2+ substituted for Mg 2+ proved useless since neither metal could be located (location 
of Co 24 * through Patterson analysis was improbable anyway due to its lightness), although 
5 with phases their positions could reveal metal site number and location. Co 2+ and Cd 2+ 

were chosen for co-crystallization based on their reported ability to support nucleotide 
binding to the C-subunit and support catalysis, although at a reduced rate. 

The single most important modification in the crystallization protocol that led to the 
formation of crystals in a different space group was the careful selection of polyethylene 
10 glycol in combination with various low molecular-weight alcohols. Commercially available 

polyethylene glycol contains various contaminants that may cause problems in the 
achievement of stable and reproducible crystallization conditions. All commercially available 
polyethylene glycols (PEG) were examined with the aim of detecting the presence of ionic 
species. 

15 The lowest level of ionic contaminants was detected in PEG manufactured by Dow 

Chemical. It is this PEG that was selected for further crystallization experiments. PEG from 
other sources appeared to be generally more contaminated and also exhibited large 
differences in contamination between batches. In our experiments, several molecular 
weights of PEG were used along with several low-molecular-weight alcohols. 

2 0 The catalytic subunit crystallized in the hexagonal space group with the introduction 

of PKi(5-24) and MgATP, whereas in its apo form it crystallized in the cubic space group 
using otherwise identical crystallization conditions, indicates that the hexagonal crystal may 
arise as a result of a different conformational state of the enzyme. 

Example 2 

25 Mouse recombinant C-subunit Crystal Forms 

One of the most promising directions for combining crystallographic methods with 
those of molecular biology is the development of highly effective vectors for expressing large 
amounts of protein for crystallization. Expression of protein in E. coli also provides a 
mechanism for eliminating posttranslational modifications which may hinder crystallization 

JO and in addition permits structure-function studies on mutant forms of the protein following 

the generation of mutant containing crystals. 

The recombinant murine catalytic subunit, whose expression and purification was 
described by Slice et al M is devoid of myristic acid at the N terminus and differs by nine 
amino acids from the porcine heart catalytic subunit used in the earlier crystallizations. It 
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has been shown, however, that N-terminal myristoylation is not necessary for C-subunit 
function. Additional differences between the porcine heart and recombinant mouse C a 
proteins include the presence of additional phosphorylation sites (Ser 10 and Ser 139) on 
the recombinant protein. 

5 Crystals were prepared from a binary complex of the recombinant mouse C a -subunit 

with a bound, high-affinity (Kj = 3 nM) inhibitor peptide. The peptide (PKI95-24) derived 
from the N-terminal region of the naturally occurring thermostable protein kinase inhibitor 
protein (PKI) t is the same peptide inhibitor used for the porcine heart ternary complex 
crystal. The steady state kinetics of the C-subunit purified from E.coli, are indistinguishable 
10 from those of the mammalian C-subunit, although the E. coli protein is more labile to heat 

denaturation. 

The recombinant protein was crystallized using a small variation of the porcine heart 
ternary complex (hexagonal) conditions. A photograph of an exemplary crystal is provided 
in Figure ISC, First a ternary complex was prepared with MgATP and PKI(5-24). A 
C:PKI(5-24) binary complex was obtained after small-angle neutron scattering experiments 
showed that for the recombinant mouse C-subunit, the PKI(5-24) peptide alone, without 
MgATP, was able to cause a significant decrease in the radius of gyration. The ternary 
complex crystal form diffracted to at least 2.7 A on the Mark III and was of orthorhombic 
space P2 1 2 1 2 1 . A data collection strategy following the procedure of Xuong, et al., (Acta 
Crvst. B4I: 267, 1985 was developed. Equipment for use with this procedure is available 
from San Diego MuJtiwire Systems of San Diego, California. The procedure allowed an 
asymmetric unit of data to be collected in 3 o-sweeps totaling about 140° (with appropriate 
choice of 4> and ^settings and a crystal mounted with one of the axes parallel to the capillarv 
axis). Data collection took about 16 h for a >90% complete 2,7-A data set from one crystal 
with R sym on the ^tensity of 4-6%; in the same period the average reflection intensity 
decayed approximately 15%. 

Crystals were generally soaked or mounted in a stabilizing solution prepared as the 
crystallization drop, but with the addition of the initial reservoir MeOH percentage and the 
omission of C-subunit and PKI(5-24). It was discovered that Cd 2+ could be substituted for 
Mg + - in crystal growth, as with the porcine heart ternary complex crystal It was also 
discovered that elevating the MgCl 2 to ten times the starting mother liquor concentration, 
after crystal growth had stopped, altered the cell dimensions slightly (<1%) and resulted in 
a different pattern of heavy-atom binding. 

Example 3 
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Recombinant Binary-Complex Structure Solution 
The binary complex crystal was nearly isomorphous with the ternary complex crystal, 
differing by less than 1% along any axis and had the same space group with a =73.62 A, 
b= 76.53 A, c* 80.14 A. The asymmetric unit contains one C:PKI(5-24) complex and has a 
5 calculated solvent content of 0.53. Mercury reagents were co-crystallized with the 

C:PKI(5-24) complex by exposing it to 1-mM reagent for six hours, followed by dialysis to 
remove excess Hg reagent. Native and co-crystallized 4-(hydroxymercuri)benzoic acid 
(PHMB) derivative diffraction data were measured on the Mark III diffractometer. Data 
from two native crystals were merged to provide a more complete data set for the initial 

10 phase computation, phase extension work, and initial refinement cycles, but later refinement 

used data from only a single crystal that yielded better quality data. Data from two PHMB 
co-crystals, one with measured Bijvoet mates, were kept separate and used to compute 
initial phases, which were improved at constant 3.5-A resolution using the solvent flattening 
approach of Wang (Methods Enzymol. 115:90, 1985), with a slightly conservative solvent 

15 content of 0.50. Additional phases from inversion of the solvent-flattened map were added 

incrementally using 6 additional envelopes to a final resolution of 2.7 A. Starting with 
minimap a-carbon coordinates, the program TOM/FRODO (available from Christian 
Cambillau, University of Marseille, Marseilles, France) was used with the resulting map to 
model in the sequence of PKI(5-24) and 257 of the 350 residues in the C-subunit. This 

2 0 partial model was refined using X-PLOR (available from Axel T. Brunger, Yale University, 

New Haven CT), and calculated structure factors were combined with Wang structure 
factors in a resolution-dependent manner using equally weighted ABCD coefficients to yield 
improved maps. ABCD coefficients are described by Hendrickson et al. (Acta Cryst. B26: 
136, 1970). Structure solution statistics are summarized in 2, and a sample of electron 

2 5 density of the structure determination is shown in Figure 3, described in detail below, 

A number of crystal forms of the catalytic subunit of cAPK have been obtained thus 
far. All of the crystal forms of the different complexes of the catalytic subunit, with the 
exception of the monoclinic crystal of the apoenzyme, were obtained under identical 
crystallization conditions and these are described above. The crystals in different space 

3 0 groups therefore very likely result from conformational states of the enzyme. Crystals of 

both the binary and ternary complexes with PKI(5-24) exhibited better diffraction 
characteristics than crystals of the apoenzyme. 

Our results also indicate that the ternary complex of the murine catalytic subunit 
expressed in E. coli produced a crystal of better quality than did the ternary complex of the 
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catalytic subunit purified from porcine heart. It is difficult to conclude whether this was due 
to the absence of myristic acid, the amino acid differences between the two forms, 
microheterogeneity in the mammalian enzyme, or a combination of these factors. It may 
suggest, however, that another way to improve the quality of crystals is to mutate the protein 
5 and to cocrystallize mutants if crystallization of the wild type fails. 

Three factors are important for reproducible crystallization. First, the salt of the 
eluting buffer of the last column must be chosen carefully. Second, the purity of the protein 
must be verified with isoelectric focusing gels. The protein must not contain typical additives, 
such as glycerol and should not be frozen prior to crystallization. Third, all reagents used 
1 0 for crystallization must be of the highest degree of purity. If all of these conditions are met, 

it is possible to obtain, in identical crystallizations, three different crystal forms representing 
two different conformational states of the enzyme. Some of those crystals, such as those of 
the ternary complex with PKI(5-24), are of much better quality than the other crystals. 

The current model consists of C-subunit residues 15-350 and 1-20 of PKI(5-24) and 
has been partially refined using X-PLOR to an R-factor of 0.195 with r.m.s.(root mean 
square) bond length deviation from ideality of 0.024 A. The location of the MgATP-binding 
site was determined by difference Fourier synthesis with the nearly isomorphous ternary 
complex crystal, which showed clear density for the adenine, ribose, and «-P0 4 for the low- 
[Mr + ] ternary complex crystal. The high-fMg 2 *] difference density showed additional 
features that could contain the p- and y-P0 4 as well as metal ion(s), but an unambiguous 
assignment of atoms to this density could not be made. 

Diffraction data is summarized in Table 1. Definitions for Table 1 are as foliows:f fa , 
calculated heavy-atom structure factor amplitude; F p . measured native structure factor 
amplitude; F ph . measured derivative structure factor amplitude; AF^, calculated Bijvoet 
2 5 difference; E is0 . r.m.s. isomorphous lack-of-closure, E^, r.m.s. anomalous lack-of-closure- 

R c = S|Fph^pKhl1IFph-F p |. 

All diffraction data were measured at 4°C using graphite-monochromated CuK a X- 
rays from the Mark HI Rigaku RU-200 rotating anode diffractometer equipped withfcvo 
Xuong-Hamlin multiwire area detectors. Paired runs starting from settings (o,4>, X ) and 
(<M>+ 180,-x) were used to collect Bijvoet mates (inverse beam method). Data reduction 
and derivative-to-native scaling were done using the UCSD area detector data processing 
programs (available from San Diego Multiwire Systems). R^-EU^ - I avg |£l and 
is shown for merged Friedel pairs. 
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Native- 1 was used for native. Hg positions of the PHMB (4- 
(hydrox7rnercuri)ben2oic acid) co-ciystal derivative were found from a difference Patterson 
synthesis. The heavy-atom sites in relation to the model suggest heavy-atom binding at Cys 
343 (major site) and Met 58 (minor site). Positional and relative occupancy refinement of 
5 two common sites (relative occupancies 2.66, 1.87 for PHMB-1), and calculation of native 

phases and corresponding ABCD coefficients, were done using the program HEAVY 
(available from the Protein Data Bank, Brookhaven National Laboratoiy, Upton, NY). 
Solvent flattening used the Wang program package (Bi-Cheng Wang, University of 
Pittsburgh, Pittsburgh, PA) on imported initial ABCD coefficients and phases to 3.5 A. 
10 Molecular envelopes were calculated with solvent content 0.50 rather than 0.53 calculated 

for the cell. After 3 envelopes at 3.5 A, the resolution was extended incrementally in 6 shells 
to a final resolution of 2.7 A. After convergence at 3.5 A, the mean phase change/reflection 
was 36.6° and the mean figure of merit was 0.84; the map inversion R-factor was 0.181. 
Phase extension added 6786 phases from 5914 in the 3.5-A starting set; 261 unobserved 
15 reflections were estimated by map inversion in the 2.7-A set. 

X-PLOR Version 2.1 was used exclusively following recommended protocols 
provided in the accompanying manual. Simulated annealing was performed according to a 
slow-cooling protocol (Brunger et al. Science 235:458*460, 1987) between either 3000K or 
4000K and 300K, followed by 120 cycles of conjugate-gradient minimization. Refinement 
2 0 began with the partial model of Stage A to improve the coordinates for phase combination. 

Combined maps were calculated using the Hendrickson-Lattman scheme. Wang phases 
were used to 6 A combined ones between 6 A and 3.5 A or 3.0 A, and calculated phases 
between 3.5 or 3.0 A and 2.7 A. The corresponding weighted amplitudes were m Wang Fo, 
m comb( 2Fo - Fc )* and m sim( 2F °- Fc )- The mod el was completed by iterative refinement and 
building in areas not included in refinement of partial model. Refinement and R-factor ( = 
IIV^I^o) calculations used F/o >2 reflections (12024 Native- 1 reflections; 10194 
Native-2 reflections beginning with Stage B). The current R-factor of 0,195 is for 2939 
atoms (no solvent atoms) with individual B-factors (r.m.s. B= 17.6 A 2 ). R.MS, bond length 
and angle deviation from ideality are 0.024 A and 4.3°. 

In summary, the crystals were grown as described above using a 5-10% molar excess 
of PKI(5-24) and were determined to be of space group P2 1 2 1 2 1 with a*73.62 A, b=7652 
A, c-80.14 A. The asymmetric unit contained one C:PKI(5-24) complex and had a ■■ 
calculated solvent content of 0.53. Native and co-crystallized 4-hydroxymercuri)benzoic add 
(PHMB) derivative diffraction data were measured using Xuong-Hamlin area detectors. 
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Data from two native crystals were merged to provide a more complete data set for the 
initial phase computation, phase extension work, and initial refinement cycles, but later 
refinement used data from only a single crystal of better quality. Data from two PHMB co- 
crystals, one with measured Bijvoet mates, were kept separate and used to computer initial 
5 phases, which were improved at constant 3.5-A resolution using the solvent flattening 

approach of Wang with a slightly conservative solvent content of 0.50. Additional phases 
from inversion of the solvent-flattened map were added incrementally using 6 additional 
envelopes to a final resolution of 2.7 A. Starting with a minima alpha-carbon coordinates, 
the program TOM/FRODO was used with the resulting map to model in the sequence of 
10 PKI(5-24) and 257 of the 350 residues in the C-subunit. This partial model was refined 

using X-PLOR, and calculated structure factors were combined with Wang structure factors 
in a resolution-dependent manner, using equally weighted ABCD coefficients to yield 
improved maps. Details on the combination can be found in Allured et al. Proc. Natl. 
Acad. Sci. USA 83:1320 t 1986 and Remington et al. J. Mol. Biol. 158:111, 1982. An example 
15 of the electron density of the structure determination is shown in Figure 3. 

Figure 3 is a stereo view of the electron density for the structure determination. 
Portions of the latest refined model of 3 p-strands are shown (top to bottom from left): 112- 
106, 114-121, 75-69. Figure 3A provides the 1.5-o experimental density calculated to 2.7A 
using phases after Wang improvement and extensions. Figure 3B provides the 1.5-o(2F c -F c ) 
2 0 density calculated with 10 to 2.7-A refined model phases. The current model consists of O 

subunit residues 15-350 and 1-20 of PKI(5-24) and has been partially refined using X-PLOR 
to an R-factor of 0.-195 with r.nus. bond length deviation from ideality of 0.024A. The 
structure of the catalytic subunit and affector molecule are described below. 

Example 4 

25 Structural Analysis of the Catalytic Subunit 

A stereo view of the backbone structure of the C-subunit with the bound peptide is 
shown in Figure 4. Residues 15-350 of the C-subunit and the twenty residues of PKI(5-24), 
in bold print, of the partially refined model are shown. The overall dimensions of the 
monomer (65A x 45A x 45A) indicate a slightly elongated molecule. Earlier hydrodynamic 
measurements showing a Stokes radius of 26. lA, a frictional coefficient ration (f/f Q ) of 1.19, 
and a radius of gyration of 20A are consistent with this structure. The most striking feature 
of the overall molecular architecture is its bilobal shape with a deep cleft between the two 
lobes. The core of the small lobe is associated primarily with the amino-terminus, while the 
core of the large lobe corresponds to the C-tenninai region of the protein. The deft 
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between the lobes is filled by a portion of the bound inhibitor peptide in the binary complex. 
A difference Fourier map of the ternary complex containing both peptide and MgATP 
places MgATP at the base of that cleft (Figure 5). The 3.5-a positive density contours for 
the (F t ernary" F binary) difference Fourier were calculated using refined model phases in 10 
5 to 2.7-A range and are shown superimposed on the partially refined backbone model. 
Figure 5A illustrates the general localization of MgATP while Figure 5B is a close-up of 
difference density enclosing an oriented model of AMP. 

The cleft is clearly the site of catalysis, and the peptide-induced conformational 
changes, observed by both SANS and circular dichroism, may be associated with a closing 
10 of this cleft. SANS established that in the absence of inhibitor and MgATP the enzyme 

adopted a more expanded conformation than that adopted by the enzyme in the binary 
complex of the enzyme and the peptide inhibitor, or the ternary complex of the enzyme, 
inhibitor and MgATP. This technique was also used to show that binding of the inhibitor 
to the enzyme did not require MgATP. Neutron scattering, in particular, established that 
1 5 the apo form of the enzyme adopts a more expanded conformation than the ternary complex 

containing MgATP and PKI(5-24). Furthermore, PKI(5-24) alone, but not MgATP, was 
sufficient to induce this conformational change. SANS and CD are techniques known to 
those of ordinary skill in this art. Accordingly, no further descriptions of these techniques 
are necessary. 

20 Most of the predictions of secondary structure made prior to this crystallographic 

study of the C-subunit are quite inaccurate and do not correlate well with the actual 
structure that is provided herein. The prediction of the secondary structure by Benner et 
al.. Adv. Enzyme Regulat. 31:121, 1991, is somewhat more accurate. It is based on chemical 
information and homologies within the protein kinase family and is accurate within the small 
25 lobe. However, detailed and accurate information on the structures of the protein kinase 

family has not been available until the discoveries presented herein. 

The amino-terminus of the C-subunit begins with an amphipathic chelix that lies 
primarily along the surface of the larger lobe. This N-terminal region differs in the 
recombinant and mammalian enzymes, since the recombinant protein lacks a myristoyl 
group at the N-terminal glycine. In the crystal structure, the first 14 amino acids are not 
visible. However, the surface of the enzyme in this N-terminal region is hydrophobic, 
suggesting a possible site for the N-terminal myristoyl moiety of the mammalian enzyme. 
The myristoyl group stabilizes the C-subunit but does not promote association with 
membranes. 
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The smaller lobe, consisting of residues 40 through 125, is associated primarily with 
the binding of the nucleotide and is characterized by a dominance of p structure. Five 
antiparallel p -strands comprise the core of this domain. The only helical element in the 
small lobe is inserted between p-strands 3 and 4 and lies on one side of the plane of the p- 
5 sheet. It consists of two parts: a two turn helix B, followed by a sharp break and a five turn 

helix, helix C. Based on a difference Fourier map (Fig. 5) with a ternary complex of the 
recombinant C-subunit containing MgATP and PKI(5-24), and supported by chemical 
evidence discussed below, it is clear that this small lobe is the primary site for interaction 
with MgATP. As seen in Figure 5, the density based on the difference map is consistent 
10 with the adenine moiety of the nucleotide oriented towards the base of the cleft beneath the 

p-sheet, with the phosphates facing outwards, towards the edge of the cleft. This structure 
is distinct from the Rossmann fold that is characteristic of many nucleotide binding proteins. 

The larger lobe, in contrast, is remarkable for its predominance of helical structure. 
Seven helices are found in this C-terminal domain. A particularly unusual feature are the 
15 antiparallel hydrophobic helices, helix E (residues 140 through 159) and especially helix F 

(residues 218 through 233), that extend right through the core of this domain. The only 
region of P-structure in this lobe is located on the surface of the cleft at the interface 
between the two lobes where four antiparallel p-strands form a sheet. Most of the regions 
important for peptide recognition, as well as some conserved residues likely to be involved 
2 0 in catalysis, are located within this larger lobe. 

The C-terminal 70 amino acids, residues 281 through 350, extend over a large 
portion of the surface of the enzyme from the bottom of the large lobe to the top of the 
small lobe. The part of this extended chain that passes through the region linking the two 
lobes appears to participate in recognition of both the peptide and the nucleotide, even 
25 though these amino acids are outside the conserved catalytic core. The other extended 

chain connecting the two lobes of the enzyme, residues 120 through 127, likewise, passes 
through this linker region between the small and large lobe and also participates in peptide 
recognition. Hence, this linking region consisting of both chains may contribute in part to 
the observed peptide-induced conformational changes described earlier. An overall two 
i 0 dimensional topology diagram for the C-subunit of cAPK is presented in Figure 6. Residues 

corresponding to the secondaiy structure elements are as follows: p-strands - 1:43-48, 2:57- 
63, 3:67-75, 4:106-111, 5:115.120, 6:1610164, 7:171-175, 8:178-183, 9:188-191; a-helices - 
A:15-31, B:76-82, C-84-97, D:-128-135, E:140-159. F:218-233, G-244-252, H:263-272, 1:288- 
293. J:301-307. 
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CORRELATION OF STRUCTURAL DATA WITH CHEMICAL DATA 

As discussed above, chemical data can be used to confirm the correct interpretation 
of the electron density map. Chemical analysis has been used as a way to obtain structural 
data in the absence of X-ray crystallography. Since the protein kinase family is an 
5 enzymatic group of major import, a significant body of chemical data is available. While this 

data cannot be used to predict a three-dimensional structure for affector modelling, it does 
provide a body of data that can be used to confirm and ensure the consistency of the three- 
dimensional structure. Thus, once a crystal structure is obtained for a model enzyme and 
its affector, the chemical data present in the literature can be used to examine the 
10 consistency of the model before proceeding to the design step. The three dimensional 

structure of the enzyme-affector complex should provide a solid explanation for the earlier 
chemical data. Information provided from chemical data together with structural data is 
used to obtain the both the template and the "lock" derived therefrom. 

For example, evidence for localizing the nucleotide binding site near the amino- 
15 terminus first came from affinity labeling with an analogue of MgATP, fluorosulfonyl 

benzoyl adenosine (FSBA). Labeling with a hydrophobic carbodiimide, DCCD, identified 
two carboxyl groups near the MgATP binding site, Aspl84 and Glu91, and, furthermore, 
established that Asp 184 could be readily cross-linked to Lys72 in the apoenzyme. The 
structure of the binary complex without bound MgATP (Fig. 7) confirms that all three 
20 residues are localized in close proximity to one another, while the difference Fourier map 

with the ternary complex places these residues close to the y-phosphate region of MgATP 
(see Fig. 5). Figure 7 provides stereo views of selected conserved areas. 1.5-o (2F Q -F C ) 
electron density (10 to 2.7A) is shown superimposed on the latest refined coordinates. In 
Figure 7A the sidechains of the invariant Lys72, Glu91, and Aspl84 are shown in proximity 
1 5 to each other. Figure 7B shows the catalytic loop, Arg-Asp-Leu-Lys-Pro-Glu-Asn (165-171), 

together with part of PKI(5-24). Arg 20 of PKI(5-?4\ is labeled as 365. Lys72 is on 0- 
strand 3, and Glu91 lies along the edge of the C-helix that faces the cleft. Aspl84 is located 
on the loop connecting p-strands 8 and 9, and this loop also lines the cleft. All three 
residues are invariant in every protein kinase. Therefore these residues can be used as 
0 anchors for modeling the three dimensional structure of other protein kinases. 

The MgATP binding site was defined more globally by differential labeling with 
acetic anhydride. By describing the reactivity of each lysine side-chain in the presence and 
absence of substrates, it was shown by Buechler et aL, Biochemistry 28:3018-3024 (1989), 
that the specific protection afforded by MgATP was localized exclusively to residues in the 
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small lobe. In addition, to Lys72, MgATP protected Lys76 and Lys47 against modification 
by acetic anhydride. These protected lysines also flank the conserved glycine-rich loop that 
lies between p -strands 1 and 2. Based on the difference Fourier shown in Figure 5, this 
loop is close to the phosphates of MgATP. 
5 Chemical studies using an affinity analogue have shown that Cys 199 is important 

for peptide binding. Modification of Cys 199 leads to loss of activity, and MgATP protects 
against inactivation. In contrast, Cys 343 can be covalendy modified with no concomitant 
loss of activity. The structural analysis reported here indicates that Cys 199 is on the surface 
of the cleft that interacts with the C-terminus of the inhibitor peptide, and Cys343 is on the 
1 0 surface of the small lobe. This distance measured between the two a -carbons of Cysl99 and 

Cys343 in the crystal structure is 24A. Thus, some of the chemical data is confirmed by the 
crystal structure. 

CONSERVED REGIONS AND THEIR FUNCTIONS 

The fact that all known protein kinases share a conserved catalytic core that is 
15 homologous to the C-subunit provides information that independently highlights important 

regions. This conserved catalytic core begins with the 0-1 strand in the small lobe and 
extends through Arg280 in the large lobe (Hanks et ah, supra t The two lobes comprising this 
conserved catalytic core can be seen clearly in Figure 8. Figure 8A is a space-filling model 
of the catalytic core (residues 40-280) shared by all protein kinases. The small lobe 
2 0 corresponding to the nucleotide binding fold 1 (residues (40-126); the larger lobe 2 (residues 

127-280). In this model the bound peptide is not shown. Figure 8B is a diagram of the 
conserved catalytic core using the RIBBON program of the PAP package (J. P. Priestle, J. 
Appl. Cryst. 21:572, 1988 and available from the Molecular Simulation Laboratory at the 
University of Minnesota, Minneapolis, MN). Regions of the linear sequence noted by 
25 Hanks et al., supra, are indicated. The protein kinase having the largest insert at each 

position is designated using the following notation to define each insert: Gene/Protein 
Name: NH 2 -terminal C-subunit residue no. (insert length) COOH-terminal C-subunit 
residue no. The inserts are CDC7:64( 14)65, KIN 1:83(26)84, PKC-y:98(6)99. c- 
mos:113(5)114, PDGFR: 137(99)138, CDC7: 196(82)197, ran + 1 :210(23)211. 
30 HSVK:240(11)241, CDC7:260(93)261, 71ess:178(7)179. Figure 8C is identical to Figure 8A,' 

but includes PKI(5-24) 3. Within this conserved core are nine invariant amino acids, as well 
as several highly conserved residues. Most of these conserved residues contribute directly 
to either MgATP binding or catalysis. Others, such as Axg280 and Asp208, exist as ion-pairs 
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and link two segments of the polypeptide chain that are widely separated in the linear 
sequence. 

In addition, to providing information on conserved residues, sequence comparisons 
among protein kinases also identify inserts, sometimes quite sizable, that lie within the 
5 catalytic core. These inserts were noted by Hanks et al., supra , but their conformation in 

the overall structure of the catalytic subunit and their relationship to other regions of the 
catalytic core is described for the first time here. The locations of these inserts are 
indicated in Figure 8. All inserts invariably are located at loops on the surface of the 
protein and can be accommodated within the tertiary structure. 
1° The structures of several important regions of the catalytic subunit are described 

below. Two highly conserved loops, as well as a triad of invariant charged residues, appear 
to be particularly important for nucleotide binding and catalysis. However the regions 
important for recognition of the peptide substrate are quite variable and were not available 
until the crystallized structure was analyzed. Predictions based on these variabilities are 
15 heretofore undescribed. 

GLYCINE-RICH LOOP 

The glycine-rich segment, Gly^-Thr-Gly^-Ser-Phe-Gly 55 , was identified originally 
as part of the MgATP binding site based on its proximity to Lys72 and on differential 
labeling with acetic anhydride, since all of the iysines flanking this region, Lys47, Lys72, and 
2 0 Lys76, are protected in the presence of MgATP. The specific structural explanation 

obtained from crystalJographic data for the protection of Lys47 is due to ionic pairing with 
the side chain of Glu333 while Lys76 ion-pairs with Glu346. Thus, conformation changes 
that occur around the glycine-rich loop as a consequence of MgATP and peptide binding 
are understood from the structural data in combination with known chemical data. 
2 5 A glycine-rich motif is associated with many nucleotide binding sites, and this region 

has been the subject of much speculation and mode 1 fading. The Rossmann fold, found 
in many nucleotide binding sites, contains a sheet of mostly parallel p-strands containing a 
glycine-rich loop. A similar motif containing a glycine-rich loop is found in other proteins 
such as adenylate kinase and p21 ras. The protein kinase fold found in the Osubunit and 
J 0 conserved in over one hundred protein kinases, does not conform to either of these motifs; 

it forms a unique nucleotide binding site. The uniqueness of this site is summarized as 
follows: (1) The glycine-rich segment lies at a sharp turn that joins two antiparallel strands 
at the beginning of the (J-sheet. (2) The phosphate binding site is not dominated by a helix 
whose dipole points towards the phosphate. (3) The nucleotide does not lie along the edge 
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of the p -sheet. (4) An invariant Lys does not immediately follow this loop. Instead, the 
invariant Lys in the protein kinases, Lys72> is located in the 0-3 strand and is a part of the 
stable scaffold of the structure. The single conserved element in each of these motifs is the 
glycine-rich loop whose apparent function is to serve as a phosphate anchor so that the y- 
5 P0 4 is poised for transfer. 

CATALYTIC LOOP 

Another highly conserved loop in the C-subunit extends from Argl65 through 
Asnl71 and can be termed the catalytic loop (Fig. 7B). This catalytic loop, Arg-Asp-Leu- 
Lys-Pro-GIu-Asn, contains 2 invariant residues, Asp 166 and Asnl71, and 2 highly conserved 
10 residues, Argl65 and Leul67. While the purpose of the glycine loop is to anchor the 

phosphate moiety and, in particular, to help position the y*P0 4 so that it is poised for 
transfer, it is the catalytic loop that appears to be the central hub that communications to 
many different parts of the molecule. This loop not only directs the catalytic event, but also 
guides the peptide into its proper orientation so that catalysis can occur. The loop itself 
15 and, in particular, the residues that are important for catalysis are highly conserved, while 

the parts of the loop that direct the peptide binding are not. 

Asp 166 is one of 4 invariant carboxyl groups in the protein kinase family. It is the 
only one that is oriented towards the Ala side chain at the pseudo-phosphorylation site in 
the bound inhibitor peptide. Aspl66 most likely functions as a catalytic base. Catalysis is 
> 0 thought to occur as a direct in-line transfer without an enzyme bound phospho-intermediate. 

INTERDQMATN CONTACTS : 

The triad composed of the side chains of Lys72, Aspl84, and Glu91, shown in Figure 
7A, is conserved in every protein kinase and is close to the y-P0 4 of MgATP. Asp 184 was 
a candidate for the catalytic base; however, the structure indicates that a more plausible role 
5 is participation in the chelation of Mg 2 * in the MgATP complex. The side chain of Asp 

184 also comes within 4-5A of the side chain of Asnl71. This cluster, Aspl84 f Asnl71, and 
Asp 166, thus forms a second triad of invariant amino acids. Asp 184, being a component of 
both triads, has the potential to shutde between the two conserved loops, the glycine-rich 
loop in the small lobe and the catalytic loop in the larger lobe. Hence, if the position of 
0 Asp 184 changes following the binding of MgATP, as it probably will given its location in the 

structure relative to the MgATP binding site, the consequences will have a direct impact on 
both conserved loops. If, for example, Asp 184 participates in the chelation of Mg +2 , its 
negative charge would be sequestered from the catalytic loop, thus allowing the other 
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residues to rearrange in order to maximize the nucleophiiicity of the serine hydroxyl moiety 
that is poised to receive the phosphate from ATP. 

This is the first protein kinase structure to be reported. The protein kinases 
represent a large family of over 100 enzymes that includes growth factor receptors as well 
5 as many oncoproteins. In spite of the tremendous diversity of these enzymes, all share a 

conserved catalytic core that retains the same essential features of secondary and tertiary 
structure and the same general mechanism of catalysis. The essential hallmarks of this 
conserved core include; (1) two lobes with a cleft between that is occupied by the substrates, 
(2) a unique nucleotide binding fold dominated by P-structure, (3) a largely helical domain 

10 associated with peptide binding and catalysis, (4) two fi-sheets converging at the active site 

near the domain interface, and (5) two conserved loops, one in each lobe, that converge at 
the active site. In marked contrast to these conserved features shared by all protein kinases, 
recognition of the peptide by the catalytic subunit involves non-conserved amino acids, and 
the peptide binding sites extend over diverse and widely separated regions on the surface 

15 of the enzyme. The detailed structure of the bound inhibitor peptide and its specific 

interactions with the catalytic subunit are described below. 

Affector binding site data may incorporate information derived from several 
experimental avenues. In addition, to crystallographic studies, substrate analogues provide 
insights into the specific features of a given substrate that are important for recognition. 

2 0 Chemical approaches such as affinity labeling and group specific labeling can identify regions 

and specific residues that are in close proximity to substrates. Crystallographic studies can 
include a structural analysis of the apoenzyme, i.e. the structure of the enzyme without other 
associated molecules. However, more importantly, crystallographic studies of co-crystals of 
the enzyme with bound substrates or affectors are provided, so that the precise features of 
25 the active site can be defined. 

Thus, in the model system of the present invention, crystals of the cAMP-dependent 
protein kinase C-subunit/PKI(5-24) were obtained and structural data derived therefrom. 
This structure of the catalytic subunit is presented as example 4. The inhibitor peptide 
PKI(5-24) is a fragment of the heat stable protein kinase inhibitor. Additional information 

3 0 about this inhibitor can be found in a publication by H.-C. Cheng et aL (Biochem J. 231:655- 

661, 1986). This peptide includes the consensus features common to all peptide substrates 
and inhibitors of cAMP-dependent protein kinase. In addition, it contains other features 
that convey unique high affinity binding. The crystals of complexed enzyme and inhibitor 
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provide insight into the guidelines necessary for designing affector molecules for other 
protein kinases. 

The general requirements for peptide recognition by the catalytic subunit were 
characterized originally using synthetic peptides. These studies found a consensus sequence 
5 that includes two basic residues, typically arginines, followed by one intervening small 

residue preceding the site of phosphotransfer. The phosphate acceptor site, either Ser or 
Thr ( is followed immediately by a large hydrophobic residue. The typical peptide substrate 
used for this kinase, based on the phosphorylation site in pyruvate kinase, is shown in Table 
2, Replacement of the Ser with an Ala generates an inhibitor peptide with poor affinity. 
10 In contrast is the inhibitor peptide, PKI(5-24), derived from the thermostable protein kinase 

inhibitor (PKI). This peptide, seen in 2, encompasses the consensus sequences described 
above but also contains additional unique features that convey high affinity binding. The 
particular features that contribute to high affinity binding were defined using peptide 
analogues, and the most important ones are also indicated in 2. This peptide, PKI(5-24), 
15 was co-crystallized with the catalytic subunit, and the structure of that peptide as well as its 

interaction with the protein are discussed below. 

A schematic of substrate and inhibitor peptides of cAMP-dependent protein kinase 
are provided ih Table 2. The nomenclature used for the peptides designates the 
phosphorylation site or pseudophosphorylatioh site residue as P. In the case of substrates, 
2 0 P will be Ser or Thr; in the case of PKI(5-24), P is Ala. The residues flanking this site are 

designated as P+ 1, P-l, etc. as indicated. This nomenclature provides a common frame of 
reference for all peptide substrates and inhibitors and can be invoked readily for every 
protein kinase. 

The Ser peptide is based on the in vivo phosphorylation site in pyruvate kinase. 
25 Residues shown to be important for peptide recognition are shaded and were identified 

using synthetic peptide analogues of the Ser peptide and of PKI. Procedures for 
determining which residues are important for peptide recognition using peptide analogues 
can be found in articles by Glass et al. and Kemp et al. (J. Biol. Chem. 262:8802-8810, 1989 
and J. BioL Chem. 252:4888-4894, 1977 respectively.) 
30 Example 5 

Conformation Determination of the Bound Inhibitor 

The conformation of bound PKI(5-24) is shown in Figure 9. Backbone C and N 
atoms are shown in bold Residues particularly important for binding are labelled according 
to the nomenclature of 2. The amino-terminus extending from the P-16 Thr through theP-8 
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Ala forms an amphipathic a -helix. This helix is followed by a turn flanked by glycines at the 
P-7 and P-4 positions. The glycines may be important for accommodating the turn or for 
providing flexibility to facilitate binding of the Arg that follows each Gly, The remainder 
of the peptide is in an extended conformation, and the density corresponding to the region 
5 at the C-terminus, the P+2 Asp and the P+3 His, is not well defined. 

The catalytic subunit itself consists of 2 lobes - a smaller lobe, associated primarily 
with MgATP binding, and a larger lobe. Nearly all of the features necessary for peptide 
recognition are found within the larger lobe, although the specific residues involved are 
widely dispersed both in the linear sequence and on the surface of the enzyme. The 
10 extended portion of the peptide that includes the consensus region for recognition of all 

substrates and inhibitors lies along the surface of the cleft corresponding to the larger lobe. 
The helical segment of the peptide is amphipathic, and its hydrophobic side lies in a 
hydrophobic pocket on the surface of the large lobe. The specific interactions of the peptide 
with the protein can be described by (i) the interactions that account for the unique highly 
15 affinity binding of PKI and (ii) by the features of the protein that are important for 

recognizing the consensus sequence common to both the inhibitors and substrate. 
HIGH AFFINITY BINDING STTF 

Based on the crystal structure, the high affinity binding attributed to the N-terminus 
of PKI(5-24) is dominated by hydrophobic interactions involving primarily the phenylalanine 
2 0 side chain at the P-ll position. Glass et al. showed that a replacement of this Phe with an 

Ala caused a 100-fold increase in Kj while replacement with l'-napthylalanine, a residue that 
is considerably larger and more hydrophobic than Phe, actually decreased the Kj by 4-fold. 
Figure 10 illustrates the high affinity binding site interactions. A hydrophobic pocket on the 
surface of the C-subunit nicely complements the hydrophobic face of the helix in the 
1 5 inhibitor peptide. This hydrophobic pocket is lined by residues 235 through 239, Tyr-Pro- 

Pro-Phe-Phe f with the phenyl ring in the inhibitor p«ntide sandwiched between the side 
chains of Tyr235 and Phe239. In Figure 10A the P-ll Phe ring (numbered 356) is shown in 
relation to three nearby hydrophobic residues of the C-subunit: Tyr 235, Pro 236 t and Phe 
239. Figure 10B illustrates the interactions of the P-l 1 Phe and P-6 Arg sidechains with the 
10 C-subunit. Distances between charged-residue sidechain atoms <3.5A apart are indicated 

by thin connecting lines. Based on the structure, the Tyr at the P-14 position is not essential 
for this hydrophobic interaction. In addition, to the hydrophobic interactions associated with 
the helix, the orientation of the high affinity binding region PKI(5-24) is fixed by the ionic 
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contacts involving the P-6 Arg. Two nitrogens in the guanidine side chain of this Arg 
undergo ion-pairing with the two oxygens of the carboxyl group of Glu203. 
CONSENSUS RECOGNITION SITE 

Figure 11 summarizes the interactions that contribute to recognition of the common 
5 consensus region of the inhibitor peptide. The interactions of the P-3 and P-2 Arg residues 

and the P-f 1 lie residue with C-subunit residues are shown. Lines are drawn between 
charged-residue sidechain atoms <3.5A apart. The P+l lie sidechain projects into the 
hydrophobic area formed by Leu 198 f Pro 202, and Leu 205. Electrostatic interactions 
dominate the portion of the peptide proximal to the site of phosphotransfer, while 
10 hydrophobic interactions dominate the C-terminal region distal to the phosphotransfer site. 

Two important requirements for peptide recognition by cAPK are basic residues at 
the P-3 and P-2 positions. Others have shown that replacing either Arg in the Ser peptide 
substrate leads to a 16-400-fold increase in f^, even when the Arg is replaced with a Lys. 
The environment flanking the P-3 and P-2 arginines explains these results since each Arg 
15 interacts with more than one carboxyl side chain. 

Table 3 provides a listing of the amino acid residues present at the various points 
of contact between PKI(5-24) and two protein kinases, cAPK and casein kinase II (CKII). 
It can be seen from Figure 11 and Table 3 that in the C subunit of cAPK, that those 
residues lining the p + 1 site are very hydrophobic and provide a pocket for the hydrophobic 
2 0 p-f l residues. In CKII, the residues lining this pocket are all basic or positively charged. 

This basic pocket compliments an acidic residue at the p+ 1 position and this is consistent 
with the known specificity of CKII, i.e. CKII prefers acidic groups at the p+1 position. 

Figure 12 provides information on the consensus recognition site binding 
interactions. The electron density corresponding to the anionic P-3 site is shown in Figure 
25 12A. Residue numbers 361, 364, 365, and 368 correspond respectively to PKI(5-24) P-6, 

P-3, P-2, and P + 1 residues. The electron density of the P-3 Arg sidechain tip is shown in 
proximity to Thr 51 carbonyl in the glycine-rich loop, and Glu 127 and GIu 331 sidechain 
carboxylates of the domain-linking region. In Figure 12B the l-o (2F 0 .F C ) electron density 
of the P-2 Arg sidechain is shown in proximity to sidechain carboxylates of Glu 170 of the 
3 0 catalytic loop and Glu 230: the P-6 Arg sidechain is shown near sidechain carboxylate of Glu 

203. In Figure 12C the U-o (2F D F C ) electron density of the P*I He sidechain is shown 
projecting into a hydrophobic pocket comprised of residues Leul98, Pro202, and Leu205. 
The side chain of this P-3 Arg interacts with Glu 127. The carboxyl side chain of Glu331 also 
is approximately 3A from the guanidinium nitrogens. The tip of Asp329 is approximately 
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SA away. Thus, the position of the guanidinium moiety is fixed In addition, the side chain 
of the P-3 Arg comes close to the backbone carbonyl of Thr51 in the glycine-rich loop and 
to the hydroxyls of the ribose ring. The side chain of Glu333 lies close to Lys47 in p-strand 
l t and the side chain Glu334 is approximately 3A from the hydroxyl group of Thr48. 
5 The P-2 site is also very anionic, and this Arg, likewise, interacts with more than one 

carboxyl group. As indicated in Figures 11 and 12B, the e-nitrogen forms an ion-pair with 
Glu 170, while one of the terminal nitrogens interacts with Glu230. Glu 203 also comes close 
to this guanidinium side chain; however, its interaction with the P-6 Arg is dominant. In the 
absence of an Arg at the P-6 position, Glu230 may ion-pair with the P-2 Arg. Unlike the 
10 P-3 recognition site, all of the carboxyl groups at the P-2 site are an integral part of the 

large lobe. 

DISTAL HYDROPHOBIC SITE (P+U 

Peptide analogue studies of others predicted a hydrophobic requirement at the P+ 1 
position since replacement of the Leu with Gly in PKJ(5-24) caused a 150-fold increase in 
15 Kjj. The reasons for this requirement are now clear from the structure (Figs. 11 and 12G). 

Leu 198, Pro202, and Leu205 form a hydrophobic groove that surrounds the He side chain. 
This hydrophobic region that constitutes the P+l site lies at the edge of the cleft and is 
likely important for proper orientation of the actual site of phosphotransfer at the P 
position. In the binary complex this region begins to align in an antiparallel P-like 

2 0 configuration with the carbonyl of the P+i lie coming less than 4A from the backbone 

amide of GIy200 and the carbonyl of Gly200 coming within approximately 3A of the 
backbone amide of this lie at the P+ 1 position (Fig, 13B). Substitution of a Pro for Leu 
at the P+ 1 position in the Ser peptide (2) yields an extremely poor substrate. Nevertheless, 
a depsipeptide analogue of this peptide lacking an amide proton at this P+ 1 site is still a 
25 good substrate for the catalytic subunit. 

Figure 13 illustrates the catalytic site area. Residue numbers 364 and 367 
correspond to the P-3 Arg and the P Ala. L5-o (2F D .F C ) electron density is shown in all 
cases. Figure 13A provides the site of catalysis together with the possible catalytic base 
sidechain of Asp 166 near the p-C of the P Ala. Thr 51 of the glycine-rich loop is shown 

3 0 near the P-3 Arg sidechain, and hydrophobic sidechains of residues Phe 54 (at the loop 

apex) and Phe 187 are shown near the site of phosphotransfer. The addition of a hydroxy! 
group would place the side chain of the residue at the P position close enough for a direct 
transfer of the y -phosphate for MgATP. The side chain of the P-l Asn also interacts with 
the glycine-rich loop as shown in Fig. 13A. Figure 13B diagrams the consensus recognition 
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site residues Arg-Arg-Asn-AIa-Ile together with the glycine-rich phosphate anchor loop to 
the left and residues 198-202 to the right. The term "residue" is here used interchangeably 
with amino acid. The carbonyl of Gly 200 can be seen pointing to the amide N of the P+ 1 
De. Figure 13C shows the phosphate of Thr 197 buried among sidechains of cationic and 
5 H-bonding residues (His 87, Arg 165, Lys 189, Thr 195). Cys 199 is also shown nearby. 

Thrl97, one of the two stable phosphorylation sites in this enzyme, also flanks the 
P+ 1 site. Multiple electrostatic interactions, seen in Figure 13C, hold this P0 4 in place and 
account for its resistance to removal by phosphatases. Fixing this phosphate moiety 
contributes conformational stability, not only to Thrl97 but also to the adjacent hydrophobic 
0 residues important for recognition at the P+ 1 site and for the proper orientation of the site 

of phosphotransfer. Based on the crystal structure, this anionic group appears to be 
important for the final correct assembly of the structure. 
CORRELATION WITH EXPERIMENTAL PREDICTIONS 

Several chemical approaches identified amino acid side chains that contribute to 
5 peptide recognition. Differential labeling with a water soluble carbodiimide, l-ethyl-3(3- 

dimethylaminopropyl) carbodiimide (EDC), targeted solvent-accessible carboxyl groups that 
were accessible in the free C-subunit but protected in the presence of substrate (Buechler 
et al. Biochemistry 29:1937-1943, 1990). Two regions were identified using this approach. 
Glul70 was very reactive in the absence of peptide, but fully protected in the presence of 
peptide. The other region was the cluster of carboxyl groups near the C-terminus, Asp 328 - 
Asp-Tyr-Glu-Glu-Glu-Glu 334 . As indicated in Figure 12, Glul70 interacts with the P-2 Arg 
while the cluster of carboxyl groups flanks the P-3 site. 

The crystal structure localized Cysl99 close to the peptide recognition site and to 
the y-P0 4 subsite of ATP. In the binary complex, Cys 199 does not appear to participate 
in peptide binding other than to contribute to the general hydrophobic environment around 
the P-f-1 site. 

CONFORMATIO NAL CHANGES ASSOCIATED WITH FEPTTDE- BINDING 

Substrate-induced conformational changes are associated with peptide binding to the 
catalytic subunit. Global changes in conformation, first observed using circular dichroism, 
showed both a loss of alpha-helical content and an increase in beta structure following 
peptide binding. A global ehange in shape also was observed using low angle neutron 
scattering. These results demonstrated a reduction of the radius of gyration (Rg) following 
substrate binding and furthermore established that the inhibitor peptide alone, but not 
MgATP, was sufficient to cause the reduction in Rg. The substrate-induced reduction in Rg 
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indicates that the apoenzyme corresponds to an open configuration of the protein while the 
binary and ternary complex represent a closed configuration. 

The recognition of the peptide by the catalytic subunit is believed to be a multistep 
process. The initial step, associated with a loss in a-helical structure, was induced by both 
5 the substrate and inhibitor heptapeptides shown in 2. The second step, presumably 

corresponding to the final orientation of the peptide into the correct position at the active 
site, was associated with an increase in P-structure and could only be accomplished with the 
substrate peptide, not by the Ala peptide inhibitor. This increase in p-structure is probably 
due, in part, to the P+ 1 region of the peptide interacting the protein. Understanding these 

10 substrate-induced conformational changes will eventually require a detailed comparison of 

the apoenzyme structure with binary and ternary complexes containing inhibitors and 
substrate peptides both in the presence and absence of MgATP. 

The peptide-induced conformational changes in catalytic subunit may reflect a closing 
of the cleft and probably involve the region linking the small and large lobes as well. This 

15 linker region consists of two chains: residues 123 through 127 and a highly acidic segment, 

residues 328-334. The P-3 peptide binding site is the only region of the inhibitor peptide 
that interacts directly with both of these extended chains that link the two lobes. One 
anionic group at the P-3 site is Glu 127 and the other is Glu331. Since several of the carboxy 
groups in the C-terminal linking chain also interact with portions of the nucleotide binding 

2 0 site, even in the binary complex, this P-3 residue may contribute to the substrate-induced 

conformational changes. 

CONSERVED A ND VARIABLE SITES IN PROTEIN KTNASFS 

The recognition of a protein substrate by the catalytic subunit is not unlike the 
recognition of a protein antigen by the variable domain of an immunoglobulin. The binding 
25 sites of both structures are dominated by interfacing p-sheets surrounded by loops that 

participate in recognition of the protein. The catalytic subunit also has helical regions, but 
it is the p-sheets that converge at the active site and it is the loops that play the dominant 
role in peptide recognition and catalysis. One p-sheet comes from the small lobe and the 
other from the large lobe. These two sheets are sandwiched together at the cleft. In the 
case of protein kinases, two of the loops are essential for catalysis and are highly conserved, 
unlike the immunoglobulins, whose function is only to bind antigens. 

The two essential conserved loops that assemble at the site of catalysis in the 
catalytic subunit. seen in Figures 14 and 15, are the glycine-rich loop in the small lobe and 
the catalytic loop in the large lobe. Both lie on the surface that lines the cleft between the 
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two lobes. The glycine-rich loop serves as an anchor for the phosphates of MgATP, whereas 
the catalytic loop is essential for peptide binding and catalysis. Key features of the active 
site of the catalytic subunit are shown in Figure 14. Nine of the amino acids that are nearly 
invariant in all protein kinases are indicated. Glyl86, another invariant residue, is not 
5 shown. The alpha carbons are in black, oxygens dotted, and nitrogens in horizontal 

hatching. The position of the phosphorylation site at Thrl97 is indicated by vertical 
hatching. The portion of the active site associated with the small lobe is shaded and 
includes three of the invariant amino acids, Gly 52, Lys72, and Glu91. The remaining six 
are located in the large lobe. Residues close enough for hydrogen bonding or ion pairing 
10 are indicated by a dashed line while residues within 4-5 A of one another are connected by 

a dotted line. As seen in Figure 14, seven of the nine invariant amino acids conserved in 
all protein kinases are located here, either in the loops themselves or connecting directly 
with loop residues. The single invariant glycine, Gly52, lies in the phosphate anchoring loop. 
The proposed catalytic base, Aspl66, as well as Asnl71, are in the catalytic loop. It is 
15 remarkable how thoroughly interconnected this region is with multiple ion pairs providing 

a finely tuned scaffolding for communication at the active site. 

The three invariant residues in the small lobe all participate in nucleotide binding. 
Unlike Gly52, which is part of a flexible loop, both Lys72 and Glu91 are anchored to defined 
pans of the secondary structure - Lys72 to p-strand 3 and Glu91 to the C-helix. The 
2 0 difference Fourier map shows the phosphate density near these residues, with the presumed 

Y-P0 4 density close enough to the P Ala for phosphotransfer were it a SerfThr] and 
indicates that these residues play a key role in the recognition of the phosphates of MgATP. 

In the catalytic loop the two invariant residues, Aspl66 and Asnl71, interact with 
each other. Not only are their side chains close, but, more importantly, the nitrogen in the 
25 amide side chain of Asnl71 is less than 3A from the backbone carbonyl of Aspl66. One 

additional nearly invariant residue, Asp220, contributes directly to stabilization of the 
catalytic loop. The two oxygens of this carboxylate come with hydrogen bonding distance 
of the backbone carbonyl and amide of residue 164 that immediately precedes the loop. 
The interaction of the catalytic loop with a conserved residue that lies deep within the large 
3 0 lobe fixes the loop from one side while peptide binding and interactions with the small lobe 

fix it from the opposite direction. As seen in Figure i3A and 13B. the consensus region of 
the peptide is sandwiched between the P+l site on one side and the glycine-rich loop on 
the other side. 
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Of all the invariant residues, Asp 184 is the only one that appears to communicate 
with both the small lobe and the large lobe. In the binary complex, it is most closely 
associated with Lys72, but it is also only 4-5A from Asnl71 and Aspl66 in the catalytic loop. 
Although not shown in Figure 14, Asp 184 is itself part of a tight turn with the carboxylate 
5 located within hydrogen bonding distance of the backbone amide of Glyl86, another 

invariant residue. This entire segment, Aspl66-Phe-Gly, is highly conserved in all protein 
kinases, and hydrogen bonding to stabilize the turn is probably conserved as well. Aspl84 
certainly has the potential to shuttle between the two conserved loops, and it is anticipated 
that the contacts of Aspl84 will differ somewhat in both the apoenzyme and in the ternary 
10 complex containing bound MgATP as well as peptide. If Asp 184 participates in the 

chelation of Mg 2 *, as disclosed above, then this charge will be sequestered from the 
immediate environment of the catalytic loop. Other residues close to the conserved residues 
in the catalytic loop in the binary complex are Tyrl64 and Lys 168. The Tyr 164 side chain 
is less than 3A from the side chain nitrogen of Asnl71, and the Lysl68 side chain comes 
15 close to the carboxylate of Aspl66. Either Tyr or His, another good hydrogen-bonding 

residue, is always found at position 164, so this contact can also be conserved. Any 
significant change in the position of Aspl84 will likely change the environment of the 
catalytic loop. Aspl84, as well as Asnl71 and Aspl66, have also been identified as a 
sequence motif associated with many phosphotransferases, and this may represent a 
2 0 common mechanism among protein kinases. 

The versatility and importance of the catalytic loop is highlighted not only by the 
conserved networking of essential amino acids at the active site, but also by the special ways 
in which this conserved network communicates with the variable residues that compose the 
peptide binding sites. This communication specifically involves loop residues that are not 
2 5 highly conserved. Glul70, for example, contributes directly to the anionic P-2 site, Thr201 

in the P+ 1 site, on the other hand, comes very close tr side chain of Asp 166. These two 
particular regions of contact involving the peptide binding site and the catalytic loop, Lys 168 
-Pro-Glu and Thr 201 -Pro.Glu.Tyr-Leu.Ala-Pro-Glu, contain sequences that differ 
characteristically between the kinases that transfer phosphate to Ser/TTir and those that 
30 transfer phosphate to tyrosine (Hanks et al., supra ). 

Argl65 is actually highly conserved in most protein kinases, and it connects in a 
unique way with the P+ 1 peptide binding site. Specifically, it points towards the 
phosphothreonine and helps to fix that phosphate so that the hydrophobic groove that 
follows and provides a pocket for the side chain of the P+ 1 residue is firmly positioned 
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(Figure 13 C). This is an autophosphorylation site, and it is the only phosphorylation site 
in the catalytic subunit that could conceivably result from an intramolecular 
autophosphorylation. Chemical analysis has shown that this phosphate is very resistant to 
removal by phosphatases and based on this ciystallographic data, appears to contribute to 
the final conformation stability of the enzyme. It should be emphasized as well that a 
phosphorylation site in this region of the protein is not a conserved feature of all protein 
kinases. Some kinases such as pp60 c " src , a protooncogene whose viral counterpart is found 
in Rous Sarcoma Virus, do have an autophosphorylation site nearby, but many others do 
not. Whether the catalytic loop communicates in unique ways with other 
autophosphorylation sites in other protein kinases remains to be established. 

The two invariant residues that are most distant from the active site are Asp208 and 
Arg280. These residues constitute a conserved ion pair that lies just beneath the P-f* 1 site 
and appears to stabilize a very hydrophobic region that buttresses the P+ 1 peptide binding 
site. 

Figure 15 is a schematic of the conserved and variable regions of the catalytic 
subunit. The ribbon diagram depicts the folding of the catalytic subunit. Conserved regions 
include two loops - the glycine-rich loop and the catalytic loop - and are indicated. The 
variable peptide binding sites are shown as solid areas. Invariant amino acids Gly52 T Lys72, 
Glu91, Aspl66, Asnl71, Aspl84, Glu208, Asp220, and Arg280 are indicated by a large dot 
and are numbered. Dashed lines indicate residues that are close enough to pair, while the 
dotted line extends from Argl65 to the Thrl97. Several points should be emphasized 
regarding the recognition of a peptide or protein substrate by the catalytic subunit. First 
is the number of sites and their diversity. Some of these peripheral peptide recognition sites 
are hydrophilic and highly charged; others are hydrophobic. As seen in Figure 15, most are 
found within the large lobe of the catalytic core shared by all protein kinases, but some also 
lie outside of this boundary. A second observation is that the requirements for recognition 
at the consensus site are not absolute. A comparison of in vivo phosphorylation sites reveals 
that the actual residues at each site vary somewhat as does the spacing between the 
positively charge side chains and the site of phosphotransfer. Thus, even in the consensus 
region, some variability can be tolerated. A third point is the potential for variability in 
recognition of different inhibitor, and presumably substrate, proteins that bind with a high 
affinity to the catalytic subunit. Most of the features essential for the high affinity 
recognition of PKI are apparent from this structure of the binary complex. The regulatory 
subunit, however, also binds to the C-subunit with a subnanomolar affinity in the absence 
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of cAMP. The consensus region, P-3 through P+ 1, is shared by both molecules. However, 
the R-subunit, cleaved at the P-5 position, still retains is high affinity binding for the C- 
subunit. In addition, the P-16 to P + 1 region of the R^subunit is Pro-Pro-Pro-Pro-Asn-Pro- 
Val-Val-Lys-Gly-Arg-Arg-Arg-Arg-Gly-Ala-Ile, and this certainly cannot conform to the 
5 helical motif that dominates the corresponding region of PKI(5-24). Hence, an amphipathic 

helix is not required for the high affinity binding of the regulatory subunit. Instead, the 
residues that contribute to the high affinity binding of the regulatory subunit, specifically, 
must lie beyond the P+3 position and may complement a different portion of the surface 
of the C-subunit. This variability presumably can also extend to protein substrates where 
1 0 the catalytic subunit may recognize unique sequences that lie outside the consensus site. 

Unlike the conserved residues that are invariant in all protein kinases, the sites 
involved in peptide recognition differ for each kinase. About 30% have some general 
similarities to cAPK. Others are quite different. However the template allows us to predict 
the specificity of each contact point. Figure 16 provides the sequence of PKI(5-24) and 
15 illustrates the distances between the points of contact and the catalytic site in three- 

dimensional space as measured from the template. The P site or site of catalysis is denoted 
by an arrow. Asterisks designate sites particularly important for the high affinity binding 
of PKI(5-24). Recognition sites essential for PKI binding to other substrates are denoted as 
labelled archways p+ 1, p-2, p-3, p-6, and p-11. All of the distances, with the exception of 
2 0 the p+ 1 site, are greater than 5 A. That positions 5 A or greater from the site of catalysis 

are important for inhibitor specificity have heretofore been undisclosed. 

The identification of the subsites that are important to maintaining the specificity 
of the affector molecule interaction and provide Kd less than lOOnM facilitates the design 
of other inhibitors. PKI(5-24) can be used as a scaffold for molding new inhibitors, and in 
>5 addition once the electrochemical interactions are understood from an analysis of the three 

dimensional template, other affectors that are not peotides can additionally be identified 
Thus, affectors could come from a group including but not necessarily limited to peptides, 
polypeptides, unmodified molecules existing in nature, synthetic molecules, nucleic acids, 
polymers, organics, or hydrocarbons. Molecules that exist in nature and that are known to 
1 0 interact with enzymes could be modified to produce affector molecules. Examples from this 

group include antibodies, antibiotics, protein, other enzymes, lipids, polysaccharides, 
saccharides and vitamins. Thus, inhibitors can be designed that utilize both conserved and 
nonconserved points of contact. 
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The invariant residues within the protein kinase family and specifically, cAPK, are 
used to apply the template and its coordinates to other protein kinases. There are 8-9 
invariant residues for the protein kinase family. Other families may have differing numbers 
of invariant residues. Table 4 list the invariant residues and the distances between these 
5 resides. The distances are calculated between a-carbons. The distances between residues 

52, 72 and 91 are expected to remain close to constant since these residues are all in the 
amino-terminal domain of the protein. Similarly, the distances between residues 166, 171, 
184, 186, 208 and 280 would be expected to remain constant due to their being in the 
carboxy-terminal domain. Motion of the amino-terminal domain relative to the carboxy 
10 terminal domain is expected to change the distances between residues in different domains. 

The distances calculated in Table 4 help form the model template since these three- 
dimensional positions are taken from the crystal diffraction patterns and help to define a 
conserved shape for the protein kinase family catalytic core. A point of contact is defined 
herein to occur at the invariant residues and is additionally defined as a point of close 
15 spatial approximation between the atoms of the residues within or around the catalytic core 

and the atoms of the affector. These points of contact affect the specificity and the Kd of 
the enzyme/affector interaction. 

The template is best described by Figure 11. The coordinates for the template listed 
in Table 4 and Figure 17 provide the spatial characteristics that permit one of skill in the 
2 0 an to input the template structure into a computer program and perform the invention 

disclosed herein. While the coordinates together define a three-dimensional surface that 
permits visualization of the catalytic site, there are invariant residues that establish 
important foci within the structure. 

Lys 72 is invariable within the catalytic site for the protein kinase family and is an 

2 5 anchor for superimposing other protein kinases onto the template. Asp 166 can additionally 

be a second important anchor. Similarly the other invariant positions likewise have 
importance for fitting other kinases. A combination of the coordinates with the invariant 
residue positions allows important regions within and around the catalytic site to be 
visualized. From a study of the interaction of cAPK with PKI(5-24), important hydrophobic 

3 0 and ionic interactions can be analyzed. When a new enzyme is superimposed onto these 

coordinates these hydrophobic and ionic interactions are assessed with PKI(5-24). It is then 
possible to study what changes can be made to PKI(5-24) to model a new affector, A study 
of the residue sidechains and the charge distribution within the site is used to fine tune the 
new affector. 



BNSOOCID- <WO 9302209A1> 



WO 93/02209 Q (5 PCT/US92/06I37 

-51- 

Any protein kinase having homology in and around the catalytic site with cAPK can 
be used to design specific affector molecules. Hanks et al. provides a list with homologous 
residues highlighted. Many growth factor receptors have protein kinase activities. These 
include but are not limited to platelet-derived growth factor, colony stimulating factor, the 
5 insulin receptor family and epidermal growth factor. Protein kinases are involved in 

hematopoiesis and lymphopoiesis. Some, like myosin light chain kinase, are calcium- 
calmodulin dependent, and further, a variety of protein kinases are oncogenic products. 
These include but are not limited to viral and cellular homologues of src, mos, abl, Neu, Fgr, 
and Yes. Any of these kinases as well as others fitting the characteristics disclosed herein 
10 could be used in this invention to produce specific affector molecules. 

The phosphorylation target sequences are available for a variety of protein kinases. 
These include phosphorylatable amino acids with their surrounding residues. For some 
kinases this will provide a good starting point for inhibitor design. Other protein kinases 
have a regulatory subunit associated with the catalytic subunit in the inactive form. The 
15 binding sequences with the regulatory subunits are other starting points for affector 

molecule design. Additionally, there are a group of protein kinases that have a regulatory 
domain. This domain binds the catalytic site when the enzyme is inactive. Binding of an 
exogenous molecule changes the kinase conformation such that the regulatory domain no 
longer binds. A review by Pearson et al. provides a table of protein kinase phosphorylation 
2 0 site sequences (Methods in Enzymology Vol. 200, 1991 in press). 

Once a template is created there are several options available for designing an 
affector molecule and these were outlined in the section above entitled "Brief Description 
of Affector Design." 

EXAMPLE 6 

25 Inhibitor design for pp60 c ' src withrmf 

PP60 c ' src purification 
pp60 c " src is the proto-oncogene homologue of the src protein kinase from Rous 
Sarcoma Virus. The protein causes unrestrained cell proliferation. In this example, the 
invariant residues for pp60 c ' src are identified with a star and in bold below; 
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VHRDLRAANILVGENLVCKVADFGLARLIEDNEYTAROGAKFPIKWTAPEAALY 
GRFTT 

KSDVWSFGILLTCLTTKGRVPYPGMVNREVLDQVERGYRMPCPPECPESLHDLH 
5 CQCWR 

KEPEERPTFEYLQAFLEDYFTST 

These residues are incorporated into the appropriate position from the invariant 
10 residues listed in Table 4 using the coordinate set provided in Figure 17. Once the template 

is in place and the catalytic site from pp60 c ' src has been superimposed onto the template, 
it is possible to visualize the catalytic site. The site can additionally be refined using the 
complementary target phosphorylation site for pp60 c * src . 

RLIEDNEY*TARQGAK 
15 * denotes the site of phosphorylation. 

Residues are altered using computer modelling until a fit is achieved for pp60 c * src 
on the template. Thus, residues 184, 166, 172, 220, 208, and 280 from the pp60 c-src 
sequence have positions in space that maintain those distances disclosed in Table 4. Ionic 
and hydrophobic amino acid side chains are matched within the catalytic core with 
20 complementary residues to create a new inhibitor molecule. Recombinant cAPK is then 

mutated to duplicate the three-dimensional structure within the core. Crystals of mutated 
cAPK are analyzed alone or together with a proposed inhibitor. The structure is again 
analyzed in the context of the invariant residues listed in Table 4. 

Positions 52, 72. and 91 are mobile invariant residues whose positions will vary 
25 depending on the quality of inhibitor The distances of these residues are listed in Table 

4 for PKI(5-24) and cAPK. It is anticipated that peptide inhibitors of equal affinity for 
cAPK will have similar distances. Non-peptide inhibitors can be designed that do not 
produce a rotation, or fraction of fit, exactly in the same direction as peptide inhibitors, such 
as PKI(5-24). A comparison of the crystal structure of cAPK and cAPK with PKI(5-24) 
* 0 indicate that positions 52, 72 and 91 rotate 12° toward residues 184, 166, 172, 220, 208 and 

280. This rotation defines a range of peptide inhibitors. Another strong peptide inhibitor 
will similarly produce a 12° rotation toward the six residues listed above while residues 52, 
72 and 91 may have a smaller angle of rotation for weaker peptide inhibitors. 

In addition, there are important points of contact between cAPK and PKX(5-24). 
5 The specific contact amino acids on PKI(5-24) are starred and the corresponding points of 

contact within the catalytic core of cAPK are identified as positions p+ 1, p-2, p-3, p-6 and 
p-1 1. These points of contact are conserved within the catalytic core of all protein kinases 
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and similar points of contact will be readily identifiable to those of skill in the art for other 
protein kinases. A sphere of influence having a radius of llA or less, more preferably 6A 
or less, and extending from the inhibitor around the points of contact at positions p+ 1, p-2, 
p-3, p-6, and p-1 1 can be used to define regions that are critical for inhibitor specificity. 
5 As described above, in connection with Figure 1 1, the points of contact can be used 

to identify the replacements necessary to design appropriate inhibitors or other affectors for 
a new enzyme. Thus, amino acid replacements are used which form appropriate ionic and 
hydrophobic interactions at these points of contact. Hydrogen bonding interactions are also 
preferably used to identify replacements. Of course, the modelling can extend beyond the 
10 identified points of contacts in order to provide still further specificity 

This same type of analysis can be performed with the mutated cAPK that mimics the 
catalytic core of pp60 c - src . Thus, an analysis of the mutant crystals permits one to predict 
the affinity of a given inhibitor. The inhibitor can be further modified to improve the ionic 
and hydrophobic interactions surrounding the points of contact using the spheres of 
influence described above. The angle of rotation of the mobile invariant residues can be 
used to predict whether or not a given peptide inhibitor will be useful. These changes are 
all performed within the constraints of the coordinates of Figure 17. 

A peptide inhibitor that, once modelled has distances similar to Table 4 and meets 
the design criteria described above can be synthesized and tested for function in vitro or in 
20 vivo. 

The coordinates obtained from the binary complex and the resulting template allow 
us for the first time to fully appreciate the complexity and sophistication of the process by 
which a protein kinase recognizes its protein substrate. While peptide analogues provide 
important clues, the diversity of the peptide binding sites and their dispersion over such a 
wide area on the enzyme surface makes it imperative to have structural data on complexes 
of the enzyme with affector molecules. The structure of the binary complex of cAPK with 
PKI(5-24) provides, for the first time, a molecular basis for the rational design of affector 
molecules, both peptide and nonpeptide, that can target specific protein kinases. 
Furthermore, because the basic catalytic core of this enzyme is so conserved in all protein 
kinases, a template based on the crystal structure can also serve as a mold for modelling for 
other protein kinases. 

Although this invention has been described using protein kinases as a model system, 
with cAPK being shown as a specific example of an enzyme for determining the template,' 
the present invention is not intended to be limited to this model. Other changes to the 
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methods described herein will suggest themselves to those of ordinary skill in the an. 
Accordingly, the spirit and scope of the present invention is to be determined with reference 
to the appendant claims. 
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WE CLAIM : 

1. A method of designing a highly specific affector which exerts an effect on the 
activity of a first enzyme, said first enzyme being a member of a class of enzymes having a 
conserved catalytic core, comprising: 
5 identifying a second enzyme that is a member of said class in which a first 

affector can affect the activity of said second enzyme; 

forming a first complex of said first affector and said second enzyme; 
obtaining data regarding the conformation of said second enzyme at sites 
greater than 5 A from the site of catalysis of said second enzyme in said first 
10 complex; 

designing an affector which induces a conformation on said first enzyme at 
sites greater than 5 A from the site of catalysis thereof which is homologous to the 
conformation of the conformation of said second enzyme at homologous sites in said 
first complex, when said affector is formed as a second complex with said first 
15 enzyme; and 

producing said affector. 

2. The method of Claim 1, additionally comprising crystallizing said first 
complex and obtaining X-ray crystallography data therefrom. 

3. The method of Claim 1, wherein the designing step comprises: 

20 identifying a potential affector likely to induce a conformation on the 

catalytic core of said first enzyme which is homologous to the conformation of the 
catalytic core of said second enzyme in said first complex, when said affector is 
formed as a second complex with said first enzyme; and 

determining whether said potential affector induces said conformation 

25 through the use of a technique selected from the group consisting of computer 

modelling, small angle neutron scattering, and circular dichroism. 

4. The method of Claim 3, wherein said potential affector comprises a peptide. 

5. The method of Claim 3, wherein said potential affector comprises a molecule 
selected from the group consisting of nucleotides, polynucleotides, alcohols, polymers, lipids 

3 0 carbohydrates, sugars, native or synthetic molecules, protein and organic compounds and 

combinations thereof 

6. The method of Claim 1, wherein all of the members of said class have related 

functions. 
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7. The method of Claim 1, wherein the catalytic cores of all of the members of 
said class have conserved amino acid residues. 

8. The method of Claim 7, wherein the designing step comprises designing an 
affector having homologous topography and charge fields that complement the catalytic core 

5 of said first enzyme and capable of inducing a conformation wherein the conserved amino 

acid residues of said first enzyme are in homologous locations to said second enzyme in said 
first complex. 

9. The method of Claim 1, wherein each of the affectors is an inhibitor. 

10. The method of Claims 1, wherein each of the affectors is an activator. 

10 11. The method of Claim 1, wherein said first affector comprises all or a portion 

of said first enzyme. 

12. The method of Claim 1, wherein said first complex is a holoenzyme. 

13. The affector produced by the method of Claim 1. 

14. The method of Claim 7, wherein said class of enzymes comprises protein 

15 kinases. 

15. The method of Claim 14, wherein said second en2yme is a viral oncogene 
product or a cellular homologue thereof. 

16. The method of Claim 15, wherein said second enzyme is p60 v-Src from RSV 
or its cellular homologue, pp60 c-src. 

20 17 - The method of Claim 16, wherein said second enzyme comprises cAMP- 

dependent protein kinase. 

18. The method of Claim 2, wherein said second enzyme comprises a native 
mammalian protein kinase. 

19. The method of Claim 2, wherein said second enzyme comprises recombinant 
25 protein kinase. 

20. A method of designing a highly specific affector which exerts an effect on the 
activity of a first enzyme, said first enzyme being a member of a class of enzymes having 
conserved residues at an affector binding site, comprising: 

identifying a second enzyme that is a member of said class in which a first 
30 affector can affect the activity of said second enzyme, said first affector having a 

dissociation constant with said second enzyme of less than 1 \iM; 

forming a first complex of said first affector and said second enzyme; 
obtaining data regarding the conformation of the affector binding site of said 
second enzyme in said first complex; 
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designing an affector which induces a conformation on the affector binding 
site of said first enzyme which is homologous to the conformation of the affector 
binding site of said second enzyme in said first complex, when said affector is 
formed as a second complex with said first enzyme; and 
5 producing said affector. 

21. The method of Claim 20, wherein said class of enzymes have a nucleotide 
binding site and each of said affectors is capable of binding to said nucleotide binding site. 

22. A method of designing a highly specific affector which exerts an effect on the 
activity of a first enzyme, said first enzyme being a member of a class of enzymes having a 

10 conserved catalytic core, comprising: 

identifying a second enzyme that is a member of said class in which a first 
affector can affect the activity of said second enzyme; 

forming a first complex of said first affector and said second enzyme, said 
first complex having at least three points of contact between said first affector and 
15 second enzyme; 

obtaining data regarding the conformation of the catalytic core of said second 
enzyme in said first complex; 

designing an affector which induces a conformation on the catalytic core of 
said first enzyme which is homologous to the conformation of the catalytic core of 
20 said second enzyme in said first complex, when said affector is formed as a second 

complex with said first enzyme; and 

producing said affector 
23. A crystallized protein kinase/affector complex having stable decay 
characteristics over 15 minutes. 

25 24. A crystallized protein kinase/affector complex having a Bragg spacing 

diffraction limit of less than 4A. 

25. The crystallized protein kinase of Claim 24 having stable decay characteristics 
over 15 minutes, 

26. A crystallized complex of a cAMP-dependent protein kinase and an inhibitor 

3 0 thereof 

27. Use of the crystallized complex of Claim 26 in an X-ray ciystallography 
procedure to produce data regarding the three dimensional structure of said cAMP- 
dependent protein kinase in said complex. 
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28. Use of the data produced by Claim 27 for designing an inhibitor of a second 
protein kinase which will induce a similar three dimensional structure on the active site of 
said second protein kinase as the three dimensional structure of said cAMP-dependent 
protein kinase in said complex. 
5 29. An inhibitor designed by Claim 28. 

30. A method of preparing a highly specific affector of a first enzyme, said first 
enzyme being a member of a class of enzymes having a conserved catalytic core with a 
plurality of invariant residues, comprising: 

a. identifying a second enzyme that is a member of said class and having 
10 a known affector thereof; 

b. forming a first complex of said second enzyme and said known 
affector; 

c. obtaining data regarding the three dimensional coordinates of said 
invariant residues in said first complex, said coordinates forming a template; 

15 d - generating a model wherein said first enzyme is in a conformation in 

which said invariant residues are in substantially the same conformation as in said 
template; 

e. identifying a change in the variable residues in the catalytic core of 
said first enzyme in the conformation of step (d) when compared to the variable 

2 0 residues in the catalytic core of said second enzyme in the conformation of step (b); 

f. preparing a modified form of said second enzyme, wherein the 
modified second enzyme includes the non-conserved change identified in step (e); 

g. designing an affector which computer modelling indicates will induce 
a conformation on the catalytic core of said first enzyme in which said invariant 

25 residues are in substantially the same coordinates as the coordinates of said 

template, when said first enzyme is formed as a second complex with the afifector 
designed in this step; and 

h. producing said affector. 

31. The method of Claim 30 wherein said change is a non-conserved change in 
30 the variable residues. 

32. The method of Claim 30, additionally comprising: 

i. forming a third complex of said modified second enzyme and an 
affector capable of binding thereto; 
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j. obtaining data regarding the three dimensional coordinates of the 
invariant residues in said third complex; and 

k. using the data obtained in step (i) to design an affector which 
computer modelling indicates will induce a conformation on the catalytic core of said 
5 first enzyme in which said invariant residues are closer to the coordinates of said 

template than the conformation induced by the affector designed in step (g), when 
said first enzyme is formed as a fourth complex with the affector designed in this 
step. 

33. The method of Claim 32 wherein the affector of step (i) is the known 
10 affector of step (a). 

34. The method of Claim 32, additionally comprising modifying the computer 
modelling used in step (g) in light of the data of step 0), prior to performing step (k). 

35. The method of Claim 30, additionally comprising obtaining amino acid 
sequence data relating to the catalytic cores of the first and second enzymes. 

15 36 ' The method of Claim 30 wherein step (f) comprises site directed mutagenesis 

of a recombinantly produced second enzyme. 

37. The method of Claim 30, wherein the coordinates of said template are 
substantially as shown in Figure 17. 

38. The method of Claim 30, wherein each of the affectors is an inhibitor. 

20 39 - . The method of CIaim 30. wherein said template includes coordinates 

separated by the distances substantially as shown in Table 4. 

40. An affector prepared by the method of Claim 39. 

41. A pharmaceutical composition comprising the affector of Claim 40. 

42. A method of designing a specific inhibitor for a protein kinase, comprising: 
obtaining data regarding the three-dimensional structure of a first protein 

kinase; 

using said data in the design of an inhibitor for a second, different, protein 
kinase; and 

producing said inhibitor. 

30 43 - ^ method of Claim 42 wherein said first protein kinase is cAMP dependent 

protein kinase or an analogue thereof. 

44. The method of Claim 43, wherein the obtaining step comprises crystallizing 
a complex of cAMP dependent protein kinase and an inhibitor thereof. 



25 
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45. The method of Claim 44, wherein the obtaining step additionally comprises 
obtaining X-ray crystallography data on the crystallized complex obtained from the 
crystallizing step, 

46. The method of Claim 44, additionally comprising obtaining information 
5 regarding the structure of the crystallized complex obtained in the crystallizing step through 

circular dichroism, small angle neutron scattering, or other chemical procedures. 

47. Use of the data of Figure 17 or of Table 4 in the design of an affector for 
a protein kinase. 

48. A method of preparing a highly specific inhibitor of a first enzyme, said first 
D enzyme being a member of a class of enzymes having a conserved catalytic core with a 

plurality of invariant residues, comprising: 

a. identifying a second enzyme that is a member of said class and having 
a known first inhibitor thereof; 

b. forming a first complex of said second en2yme and said first inhibitor; 
> c. obtaining data regarding the three dimensional coordinates of said 

invariant residues in said first complex; 

d. designing a second inhibitor which computer modelling indicates will 
induce a conformation on the catalytic core of said first enzyme in which said 
invariant residues are in substantially the same coordinates as the coordinates 
obtained in step (c), when said first enzyme is formed as a second complex with said 
second inhibitor; 

e. preparing said second inhibitor; 

f. forming a third complex of said second inhibitor and a third enzyme 
complexable therewith, said thkd enzyme having a plurality of said invariant 
residues; 

g. obtaining data regarding the three dimensional coordinates of said 
invariant residues in said third complex; 

h. using the data obtained from step (g) to design a third inhibitor which 
computer modelling indicates will induce a conformation on the catalytic core of said 
first enzyme closer to that in which said invariant residues are in substantially the 
same coordinates as the coordinates obtained in step (c) than induced by the second 
inhibitor, when said first enzyme is formed as a fourth complex with said third 
inhibitor; and 

i. producing said third inhibitor. 
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49. The method of Claim 48, wherein said first inhibitor is an inhibitory domain 
of said second enzyme. 

50. The method of Claim 48 t wherein said third enzyme comprises at least 5 
invariant residues. 

5 51. The method of Claim 48, wherein said third enzyme is a naturally occurring 

enzyme. 

52. The method of Claim 48, wherein said third enzyme is a mutant enzyme. 

53. A method of determining the efficacy of a first affector in affecting a first 
enzyme that is a member of a class of enzymes having a plurality of invariant residues 

10 among the members of said class, comprising: 

determining the three dimensional coordinates of the invariant residues of 
a second enzyme in a first conformation wherein said second enzyme is in a complex 
with a second affector that is a strong affector of said enzyme; 

determining the three dimensional coordinates of the invariant residues of 
15 said second enzyme in a second conformation wherein said enzyme is in a 

conformation other than said first conformation; 

identifying the mobile invariant residues of said enzyme, said mobile 
invariant residues being those invariant residues at coordinates substantially different 
in said first conformation than in said second conformation; 
2 0 determining the three dimensional coordinates of the mobile invariant 

residues of said first enzyme when said first enzyme is in a conformation wherein 
said first enzyme is in a complex with said first affector; 

comparing the three dimensional coordinates of the mobile invariant residues 
of said first enzyme in said conformation with the coordinates of the mobile 

2 5 invariant residues of said enzyme in said first conformation, wherein relatively small 

differences in the coordinates indicate relatively high efficacy of said first affector. 

54. The method of Claim 53, wherein the step of determining the coordinates 
of said first enzyme in said conformation is performed using computer modelling of said 
conformation. 

3 0 55. The method of Claim 53, wherein the steps of determining the first and 

second conformations comprise obtaining X-ray crystallographic data of said enzyme. 

56. The method of Claim 53, wherein said second conformation is a 
conformation produced by a ternary complex. 
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57. The method of Claim 56, wherein said ternary complex comprises a protein 
kinase, a nucleotide and an affector. 

58. The method of Claim 53, wherein said second conformation is a 
conformation produced by said second enzyme not complexed with a ligand. 

5 59. The method of Claim 53, wherein said second en2yme is the same enzyme 

as said first enzyme. 

60. A method of designing specific inhibitors of a first protein kinase having a 
template defined by a plurality of invariant residues present in substantially all protein 
kinases, comprising: 

1° obtaining data regarding the three dimensional coordinates of the invariant 

residues of a second protein kinase and of the points of contact between said second 
protein kinase and a known inhibitor thereof, said coordinates being obtained when 
said second protein kinase is formed as a complex with said known inhibitor; 

generating a model of said first protein complex wherein said template is 
15 defined by the positions of said invariant residues in said complex; 

examining the amino acid residues present in said first protein kinase at 
positions corresponding to the points of contact in said complex; 

designing an inhibitor of said first protein kinase capable of forming ionic 
and hydrophobic interactions with said amino acid residues; and 
2 0 producing said inhibitor of said first protein kinase. 

61. The method of Claim 60, wherein said second protein kinase is cAMP 
dependent protein kinase. 

62. The method of Claim 61, wherein said known inhibitor is PKI(5-24). 

63. The method of Claim 62, wherein the points of contact in said complex 
comprise positions corresponding to the positions selected from the group consisting of P+ 1, 
P-2, P-3, P-6 and P-ll along said known inhibitor. 

64. The method of Claim 60, wherein the positions corresponding to the points 
of contact in the examining step comprise positions within a sphere having a radius of 11 
A from the coordinates of the point of contact obtained in the obtaining step. 

3 0 65 * The method of Claim 64, wherein the positions corresponding to the points 

of contact in the examining step comprise positions within a sphere having a radius of 6 A 
from the coordinates of the point of contact obtained in the obtaining step. 
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66. The method of Claim 60, wherein the designing step additionally comprises 
designing said inhibitor to form appropriate hydrogen bonding with said amino acid 
residues. 
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FIGURE 10B 
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©loiorarore 



oiaiaiorororo 



CorMnsui Peptide 



turn Ext«r>dad ContormaUon 
■> ^ ► < > 



Km s 16.0 jiM 
KJ = 490pM 



S«r P«ptJd«: 
Ala Ptptldc: 

Ws2.3nM 

Nomanetitura: p M p« p* 4 pVV' P* P* P* P* P* P* P* P* P* p* P p" p* p- 

Hgh Affinity Binding Poglon 
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TABLE 



POINTS OF COWTAPT 
P+l 



P-2 

P-3 

P-6 
P-ll 



POSITTOff 

197 

198 

199 

200 

201 

202 

203 

204 

205 

170 
230 

127 
331 

203 

235 
236 
237 
238 
239 



CAPK 

Thr 

Leu 

Cys 

Gly 

Thr 

Pro 

Glu 

Tyr 

Leu 

Glu 
Glu 

Glu 
Glu 

Leu 

Tyr 
Pro 
Pro 
Phe 
Phe 



Asp 
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TABLE 4 



Angstroms I AC on 1 I Atom 2 

apart I I 

5.29 ASP 1S4 CA GLY 186 CA 

5.73 GLU 91 CA GLY 186 CA 

6.46 ASN 171 CA AS? 184 CA 

7.41 ASN 171 CA ASP 166 CA 

7.61 AS? 166 CA GLY 1S6 CA 

7.87 ASP 184 CA GLU 91 CA 

8.20 ASP 166 CA ASP 184 CA 

9.20 AS? 184 CA LYS 72 CA 

9.90 GLY 52 CA LYS 72 CA 

10.15 ASN 171 CA GLY 186 CA 

10.29 ASP 184 CA GLY 52 CA 

10.53 GLY 52 CA GLY 186 CA 
10.78 ASN 171 CA GLY 52 CA 
10.91 GLY 186 CA LYS 72 CA 
11.29 GLU 91 CA LYS 72 CA 
11.80 ARG 2 80 CA GLU 208 CA 
12.27 AS? 166 CA GLU 91 CA 
12.65 ASP 155 CA GLY 52 CA 
13.52 ASN 171 CA LYS 72 CA 
14.07 ASN 171 CA GLU 91 CA 
15.02 GLU 91 CA GLY 52 CA 
15.07 ASP 166 CA GLU 208 CA 

16.54 AS? 165 CA LYS 72 CA 
18.58 ARG 2 80 CA ASP 165 CA 
19.99 GLU 208 CA GLY 186 CA 
22.00 ASN 171 CA GLU 208 CA 

22.82 ASP 184 CA GLU 208 CA 
23.37 GLU 91 CA GLU 208 CA 
23.49 ARG 280 CA " ~N 171 CA 
24.87 ARG 280 CA GLY 186 CA 
25.18 CLO 208 CA GLY 52 CA 
25.61 ARG 280 CA ASP 184 CA 
27.34 ARG 280 CA GLU 91 CA 
30.53 GLU 208 CA LYS 72 CA 

30.83 ARG 280 CA GLY 52 CA 
34.67 ARG 280 CA LYS 72 CA 
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